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ISOLATED NUCLEIC ACID MOLECULES ENCODING CANCER ASSOCLVTED 
ANTIGENS, THE ANIIGENS PER SE, AND USES THEREOF 

RELATED APPLICATIONS 

This plication is a continuation in part of Serial No. 09/602, 362, filed June 22, 2000 
whichis acontinuationinpaTt of SerialNo. 09/451,739, filedNovember 30, 1999, both of which 
are incoiporated by reference in their entirety. 

FIELD OF THE INVENTION 

This invention relates to antigens associated with cancer, the nucleic acid molecules 
encoding them, as well as the uses of these. 
BACKGROUND AND PRIOR ART 

It is fairly well established that many pathological conditions, such as infections, cancer, 
autoimmune disorders, etc., are characterized by flie inappropriate expression of certain, 
molecules. These molecules thus serve as '^markers" for a particular pathological or abnormal 
condition. Apart from then: use as diagnostic **targets", Le., materials to be identified to diagnose 
these abnormal conditions, the molecules serve as reagents which can be used to generate 
diagnostic and/or therapeutic agents. A by no means limiting exanq)le of this is the use of cancer 
markers to produce antibodies specific to aparticular maiker. Yet another non-limiting example 
is the use of a peptide which complexes with an MHC molecule, to generate cytolytic T cells 
against abnormal cells. 

Preparation of such materials, of course, presupposes a source of the reagents used to 
generate these. Purification torn cells is one laborious, fer &om sure method of doing so. 
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Another preferred method is the isolation of nucleic acid molecules which encode a particular 
marker, followed by the use of the isolated encoding molecule to express the desired molecule. 

Two basic strategies have been employed for the detection of such antigens, in e.g., 
human tumors. These wiU be refen:ed to as the geaetic approach and the biochemical 
The gCTietic approach is exemplified by, e.g., dePlaen et al., Proc. Nafl. Sci. USA 85: 2275 
(1988), incorporated by reference. In this approach, several hundred pools of plasmids of a 
cDNA library obtained fiom a tumor are transfected into recipient cells, such as COS cells, or 
into antigen-negative variants of tumor cell lines which are tested for the expr^sion of the 
specific antigen The biochemical approach, exemplified by, e.g., O. Mandelboim, et aL, Nature 
369: 69 (1994) incorporated by reference, is based on acidic elution of peptides which have 
bound to MHC-class I molecules of tumor cells, followed by reversed-phase high performance 
liquid chromography ^tlPLC). Antigenic peptides are identified after they bind to empty MHC- 
class I molecules of mutant cell Unes, defective in antigen processing, and induce specific 
reactions with cytotoxic T-lymphocytes. These reactions include induction of CTL proliferation, 
TNF release, and lysis of target cells, measurable in an MTT assay, or a ^^Cr release assay. 

These two approaches to the molecular definition of antigens have the following 
disadvantages: first, they are enormously cumbersome, time-consuming and e3q>ensive; and 
second, they depend on the establishment of cytotoxic T cell lines (CTLs) with predefined 
specificity. 

The problems inherent to tibte two known approBchos for the identification and molecular 
definition of antigens is best demonstrated by the fact that both methods have, so far, succeeded 
in defining only very few new antigens in human tumors. See, e.g., van der Bruggen et aL, 
Science 254: 1643-1 647 (1991); Brichardetal.,L Bxp. Med. 178: 489-495 (1993); Coulie, et 
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al.,J. Exp. Med. 180: 35-42 (1994); KawakatQi,etal.,Proc. Natl. Acad Sci. USA 91: 3515- 
3519 (1994). 

Furdier, the methodologies described rely on the availability of established, pennanent 
cell lines of the cancer type under consideration. It is very dif&cult to establish cell lines fiom 
certain cancer types, as is shown by, e.g., Oettgen, et al., Lnmunol. AUerg. Clin. North. Am. 
10: 607-637(1990). It is also Imown that some q>ithehalceU type cancers are poorly susceptible 
to CTLs in vitro, precluding routine analj^is. These problems have stimulated the art to develop 
additional methodologies for identifying cancer associated antigens. 

One key methodology is described by Sahin, et al., Proc. Natl. Acad. Sci. USA 92: 
11810-11913 (1995), incorporated by reference. Also, see U.S. Patent No. 5,698,396, and 
Application Serial No. 08/479,328,ffledonJune7, 1995 and January 3, 1996, respectively. All 
three of these references are incorporated by reference. To summarize, the method involves the 
expression of cDNA libraries in a prokaiyotic host (The libraries are secured fixnn a tumor 
sample). The expressed Hbraries are then immunoscreened with absorbed and diluted sera, in 
order to detect those antigens which elicit high titer humoral responses. This methodology is 
known as the SEREX method ^'Serological identification of antigms by Recombinant 
Expression Cloning'*^. Themethodologyhasbeenen5>loyedtoconfiTmrapressionofprevious^ 
identified tumor associated antigens, as well as to detect new ones. See the above referenced 
patent applications and Sahin, et aL, supra, as well as Crew, et al., EMBO J 144: 2333-2340 
(1995). 

This methodology has been qiplied to a range of tumor types, including those described 
by Sahin et al., sujpnu and Pfrewndschuh, supra, as well as to esophageal cancer (Chen et al., 
Proc. Natl. Acad. Sci. USA 94: 1914-1918 (1997)); lung cancer (Gure et al.. Cancer Res. 58: 
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1034-1041 (1998)); colon cancer (Serial No. 08/948, 705 fiiled October 10, 1997) incorporated 
by reference, and so forth. Among the antigeDS identified via SEREX are the SSX2 molecule 
(Sahin et al., Proc. Natl. Acad. Sci. USA 92: 11810-1 1813 (1995); Tureci et al.. Cancer Res. 56: 
4766-4772 (1996); NY-ESO-l Chein,etal.,Proc. Natl. Acad. Sci USA94: 1914-1918(1997); 
and SCPl (Serial No. 08/892,705 filed July 15, 1997) incorporated by reference. Analysis of 

t 

SEKBX identified antigens has shown overlap between SEREX defined and CTL defined 
antigens. MAGE-1, tyrosinase, andNY-ESO-1 have all been shown to be recognized by patient 
antibodies as well as CTLs, showing that humoral and cell mediated responses do act in concert 
It is clear fiom this smmnary that identification of relevant antigens via SEREX is a 
desirable aim. The inventors have ^plied this methodology and have identified several new 
antigens associated with cancer, as detailed in the description which follows. 
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

EXAMPLE! 

The SEREX methodology, as described by, e.g. Sahin, et al., Proc. Natl. Acad. Sci. USA 
92: 118ia-11813(1995);Chen,etal,,Proc.Nad.Acad.Sci.USA94: 1914-1918 (1997), and 
U.S. Patent No. 5,698,396, all of which are incorporated by reference. In briet total RNA was 
extracted fit>m a sample of a cutaneous metastasis of a breast cancer patient (referred to as 
''BRl r* hereafter), using standard CsCl guanidine thiocyanate gradient methodologies. A cDNA 
library was then prepared, using commercially available kits designed for fbis purpose. 
Following the SEREX methodology referred to supra, this cDNA expression library was 
amplified, and screened with eitiier autologous BRl 1 serum which had been diluted to 1 :200, or 
with allogeneic, pooled serum, obtained fiom 7 different breast cancer patients, which had been 
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diluted to 1:1000. To cany out the screen, serum samples were first diluted to 1 :10, and then 
pteahsoifoed with lysates of E. coli that had been transfected with naked vector, and the serum 
samples were then diluted to the levels described supra. The final dilutions wer« incubated 
overnight at room temperature with nitrocellulose membranes containing phage plaques, at a 
density of 4-5000 plaque forming units C*pfijs") per 130 mm plate. 

Nitrocellulose filters were washed, and incubated with alkaline phosphatase conjugated, 
goat anti-human Fey secondary antibodies, and reactive phage plaques were visualized via 
incubation with S-biomo-4-chloro-3-indolyl phosphate and nitroblue tetrazolium. 

This procedure was also carried out on anonnal testicular cDNA library, using a 1:200 
serum dilution. 

A total of 1 . 12x1 O^pfiis were screwed in the breast cancer cDNA library, and 38 positive 
clones were identified. With respect to the testicular library, 4x10^ pfiis were screened, and 28 
positive clones were identified. 

Additionally, 8x10^ pfiis fix)m the BRl 1 cDNA library were screened using the pooled 
semm described. Of these, 23 were positive. 

The positive clones were subcloned, purified, and excised to forms suitable for insertion 
inplasmids. FoUowinganq>lificationoftfaeplasmids,DNAinsertswere evaluated viarestriction 
moping (BcoRI-Xbal), and clones which rqpresented different cDNA inserts were sequenced 
using standard methodologies. 

If sequences w^ identical to sequences found in GenBank, tiiey were classified as 
known genes, while sequences which shared identity only with ESTs, or were identical to 
nothing in these data bases, were designated as unknown genes. Of Ihe clones 6om the breast 
cancer library which were positive with autologous senun, 3 were unknown genes. Of the 
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ranaining 35, 15 were identical to either NY-ESO-1, or SSX2, two known members of the CT 
antigen family described supra, the remaining clones corresponded to 14 known genes. 
Of the testicular library, 12 of the clones were SSX2. 

TheNY-ESOl antigen was not found, probably because the conomercial library that was 
used had been size fiactionated to have an av^ge length of 1 .5 ktlobases, which is larger than 
fiill length NY-ESO-1 cDNA which is about 750 base pairs long. 

With respect to the screening carried out with pooled, allogeneic sera, four of the clones 
werehfY-ESO-L No other CT antigens were identified. With the exception of NY-ESO-1, all 
of the genes identified were repressed universally in normal tissue. 

A fiiU listing of the isolated genes, and their frequency of occurrence follows, in tables 
1, 2 and 3. Two genes were found in both the BRl 1 and testicular libraries, i.e., poly (ADP- 
nbose) polymerase, and tumor suppression genelNGl. The poly (ADP-ribose) polymerase gene 
has also been found in colon cancer libraries screened via SEREX, as is disclosed by Scanlan, 
et al.. Int. J. Cancer 76: 652-58 (1998) when the genes identified in the screening of the BRl 1 
cDNAlibraryby autologous and allogeneic sera werecompared, NY-ESO-1 andhumankeradn. 

Table 1. SEREX-^efined genes identified by autologous screening of BRll cDNA library 

Gene group No. of clones Comments Expression 
CT genes 10 NY-ESO-1 tumor, testis 

5 SSX2 tumor, testis 

Non-CT genes 5 Nuclear Receptor Co-Repressor ubiquitous 

4 Poly(ADP-ribose) polymerase ubiquitous 

2 Adenylosucdnatelyase ubiquitous 

2 cosmid 3 13 (human) ESTs: muscle, brain, breast 

1 CD 151 (ttansmeoibraneprotBin) ubiquitous 
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Human HRY Gen 

Alanyl-t-RNA-Synthetase 

NAD(+) ADP-Ribosyltransferase 

Human keratin 10 

Human EGFR kinase substrate 

ING 1 Tumor suppressor gene 

Unknown gene, 
Na_CX3AP_Prl2 cDNA clone 

Unknown gene 

Unknown gene 



RT-PCS: multiple normal tissues 

ubiquitous 

ubiquitous 

ESTs: multiple normal tissues 
ubiquitous 

RT-PCR: muhtple nonnal tissues 
ESTs: pancreas, Hver, spleen, utenis 

ESTs: mult^le normal tissues 
RT-PCR: multiple normal tissues 



Table 2. SEREX-deflned genes identified by allogenic screening of BRll cDNA library 



Gene group No. of clones Comments Esqxression 



CT genes 


4 


NY-ESO-1 


tumor, testis 


Non-Cr genes 


6 


zinc-finger heUcase 


ESTs: brain, fetal heart, total fetus 




4 


Acetoacetyl-CoA-Molase 


ubiquitous 




3 


KIAA0330gene 


ESTs: multiple normal tissues 




2 


UlsnRNP 


ubiquitous 




1 


Human aldolase A 


ubiquitous 




1 


Retinoblastoma binding protein 6 


ESTs: tonsils, fetal brain, 
endothelial cells, brain 




1 


a2-Macroglobulin recqptor 
assocjated protein 


ubiquitous 




1 


Human Keratin 10 


ESTs: imiltq)le nonnal tissues 


Table 3. SEREXnlefined genes 


IdentiOed by screening of a testicular cDNA library with BRll serum 


Gene group 


No. of clones 


Comments 


Expression 


CT genes: 


12 


SSX2 


tumor, testis 


Non-CT genes: 


3 


Rho-associated coiled-Kx>il 
forming protem 


ubiquitous 




3 


Po]y(ADP-iibose) polymerase 


ubiquitous 
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3 



Gene fix)m HeLa cell, similar to ubiquitous 



TTIIN 



2 



Gene fi:om parathyroid tumor 



RT-PCR: nu]ltq>le normal tissues 



Transcrq)tion termination factor ubiquitous 
I-interacting peptide 21 



Gene from fetal heart 



ESTs: muh^le nonnal tissues 



ING I tumor siqypressor gene 



RT-PCR: multq;)le normal tissues 



KIAA0647cDNA 



ESTs: multq)le normal tissues 



KIAA0667cDNA 



ESTs: im]ltq[ile normal tissues 



EXAMPLE! 

The mKNA expression pattern of the cDNAs identified in example 1, inbofh normal and 
malignant tissues, was studied. To do this, gene specific oligonucleotide primers were designed 
which would amplify cDNA segments 300-600 base pairs in length, using a primer melting 
temperature of 65-70° C. The primers used for ampUfying MAGE-1,2,3 and 4, BAGE, NY- 
ESOl, SCPl, and SSXl, 2, 3, 4 and 5 were known primers, or were based on published 
sequences. See Chen, et al. supra: Tureci, et al., Proc. NatL Acad Sci. USA 95: 521 1-16 (1998). 
Gure, et al., InL J. Cancer 72: 965-71 (1997); Chen, et al., Proc. Natl. Acad. Sci. USA 91 : 1004- 
1008 (1994); Gaugler, et al., J. Exp. Med 179: 921-930 (1994), dePlaen, et al., Immunogenetics 
40: 360-369 (1994), aUofwhich are incorporated by reference. RT-PCR was carried out for 35 
ampUfication cycles, at an annealing temperature of 60 ° C. Using this RT-PCR assay, the breast 
cancer tumor specimen was positive for a broad range of CT antigens, including MAGE-1,3 
AND 4, BAGS, SSX2, NY-ESOl and CT7. The known CT antigens SCP-1, SSXl, 4 and 5 
were not found to b e expressed. 

An additional set of expmments were carried out, in vMch the seroreactivity of patient 
sera against tumor antigens was tested. Specially, ELISAs were carried out, in accordance with 
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Stockert, et al., J. Exp. Med. 187: 1349-1354 (1998), incorporated by reference, to determine 
if antibodies were present in the patient sera Assays were run for MAGB-1, MAGE-3, NY- 
ES0-l,andSSX2. TheEIJSAswerepositiveforNY-ESO-1 and SSX2, but not the two MAGE 
antigens. 

EXAMPLES 

Two clones (one fronithebreast cancer cDNAUbrary and one fironith^ 
were identified as a gene referred to as INGl, which is a tumor suppressor gene candidate. See 
Gaikavtsev,etal., Nature 391: 295-8 (1998), incorporated by reference. The sequence fomid 
in the breast cancer library, differed from the known sequence of INGl at six residues, i.e., 
positions 818, 836, 855, 861, 866 and 874. The sequence with the six variants is set forth at SEQ 
ID NO: 1 . The sequence of wild ^e INGl is set out at SEQ ID NO: 2. 

To determine if any of these difTerences represented a mutation in tumors, a short, PGR 
firagment which contained the six positions referred to supra was amplified fiom a panel of 
allogeneic normal tissue, subcloned, amplified, and sequenced following standard methods. 

The results indicated that the sequences in the allogeneic tissues were identical to what 
was found in tumors, ruling out the hypothesis that the sequence differences were a tumor 
associatedmutatioiL This conclusion was confirmed, using the testicular library clone, andusing 
restriction analysis of INGl cDNA taken jfrom normal tissues. One must conclude, therefore, 
that the sequence information provided by Gaikavtsev, et al., supra, is correct. 
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EXAMPLE4 

Additional experiments were carried ont to determine whether genetic variations might 
exist in the 5' portion of the INGl gene, which might diff^ jfrom the 5' portion of the clone 
discussed supra (SEQ ID NO: 1). Li a first groiq) of experiments, attenq>ts were made to obtain 
full length INGl cDNA from both the breast tumor library, and the testicular Ebraiy. SEQ ID 
NO: 1 was nsed as aprobe of the library, using standard methods. 

Four clones were isolated fiiom the testicular library and none were isolated from the 
breast cancer library. The four clones, following sequencing, were found to desnvo &om three 
transcript variants. The tiu:ee variants were identical &om position 586 down to their 3 ' end, but 
differed in their 5 ' regions, suggesting alternatively spUced variants, involving the same exon- 
intron junction. All three differed fix>m the sequence of INGl described by Garicavtsev, et al., 
mNat Genet.l4: 415-420 (1996). These three variants are set out as SEQ ID NOS: 1, 3 and 4. 

All of the sequences were then analyzed. The ORFs of SEQ ID NOS: 2, 1 and 4 (SEQ 
ID NO: 2 is the originally disclosed , INGl sequence), encode polypeptides of 294, 279 and 235 
amino adds, of which 233 are encoded by the 3 ' region common to the three sequences. These 
putative sequences are set out as SEQ ID NOS: 19, 5, and?. With respect to SEQ ID NO: 3, 
however, no translational initiation site could be identified in its 5' region. 

EXAMPLE 5 

ThedataregaidingSEQIDNO: 3, described sffira, suggested fiirther experiments to find 
additional ORFs in the 5-end of variant transcripts of the molecule. In order to determine this, 
5'-RACE -PGR was carried out using gene specific and adapted specific primers, together with 
commercially available products, and standard methodologies. 
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The primers used for these experimeDts were: 

C ACACAGGATCCATGTTGAGTCCTGCCAACGG 

CGTGGTCGTGGTTGCTGGACGCG 
(SEQ IDNOS: 9 and 10), for SEQ IDNO: 1; 

CCCAGCGGCCCTGACGCTGTC 

CGTGGTCGTGGTTGCTGGACGCG 
(SEQ ID NOS: 1 1 and 12), for SEQ ID NO: 3; and 

GGAAGAGATAAGGCCTAGGGAAG 

CGTGGTCGTGGTTGCTGGACGCG 
(SEQ ID NOS: 13 and 14), for SEQ ID NO: 4. 

Cloning and sequencing of the products ofRACE PCR showed that Ihe variant sequence 
of SEQ ID NO: 4 was 5' to SEQ ID NO: 3, and that foil length cDNA for the variant SEQ ID 
NO: 3 contained an additional exon 609 nucleotides long, positioned betweoi SEQ ID NO: 3 and 
die shared, 3 ' sequoace referred to siyra. This exon did not iaclude an ORF. The first available 
initiation site would be an initial methionihe at amino acid 70 of SEQ ID NO: 1. Thus, if 
expressed, SEQ ID NO: 3 would coire^nd to a molecule with a 681 base pair, untranslated S' 
end and a region encoding 210 amino adds (SEQ ID N0:6). 

EXAMPLE 6 

The presence of transcript variants with at least 3 difte^nt trancriptional initiation sites, 
and possibly different promoters, suggested that mKNA eiqtression might be under different, 
tissue specific r^ulstion. 
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To determixie this, variant-specific primers were synthesized, and RT-PCR was carried 
out on a panel of tissues, using standard methods. 

SEQ ID NO: 1 was found to be expressed universally in all of the normal breast, brain 
and testis tissues examined, in six breast cancer lines, and 8 melanoma cell lines, and in cultured 
melanocytes. SEQ ID NO: 3 was found to be expressed in four of tiie six breast cancer lines, 
normal testis, liver, kidney, colon and brain. SEQ ID NO: 4 was only fpund to be expressed by 
normal testis cells and weakly in brain cells. 

EXAMPLE? 

A further set of experiments were carried out to determine if antibodies against INGl 
were present in sera of normal and cancer patients. A phase plaque immuno assay of the type 
described supra was carried out, using clones of SEQ ID NO: 1 as target. Of 14 allogeneic sera 
tak^ firom breast canc^ patients, two were positive at 1 :200 dilutions. All normal sera were 
negative. 

EXAMPLES 

TheBRll cDNA library described supra was then screened, usmg SEQ ID NO: 1 and 
standard methodologies. A 593 base pair dDNA was identified, which was diff<a:ent from any 
sequences in the data banks consulted. The sequence of this cDNA molecule is set out at SEQ 
ID NO: 8. 

The cDNA molecule set forth as SEQ ID N0:1 was then used in Southern blotting 
experiments. In briet genonaic DNA was isolated fix)m normal human tissue, digested with 
BamHI or Hind m, and then separated onto 0.7% agarose gel, blotted onto nitrocellulose filters, 
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and hybridized using labelled SEQ ID NO: 1 , at high stringency conditions (aqueous buffer, 
65 ''C). Theprobes wcare permitted to hybridize overnight, andlhen exposed for autotadiogr^hy. 
Two hybridizing DNA species were identified, i.e., SEQ ID NOS: 1 and 8. 

EXAMPLE 9 

The cDNA molecule set forth in SEQ m NO: 8 was then analyzed S'-RACEPCRwas 
carried out using normal fetus cDNA Full length cDNA for the molecule is 77 1 base pairs long, 
without the poly A tail. It shows strong homology to SEQ ID NO: 1, with the strongest 
homology in the S ' two-thirds (76% identity over nucleotide 1-480); however, the longest ORF 
is only 129 base pairs, and would encode a poly peptide 42 amino acids long which was 
homologous to, but much shorter than, the expected expression product of SEQ ID NO: 1 . 

In addition to the coding region, SEQ ID NO: 8 contains 203 base pairs of S '-untranslated 
region, and 439 base pairs of 3 '-untranslated region. 

RT-PCR assays were carried out, as described supra. All of the normal tissues tested, 
including brain, colon, testis, tissue and breast, were positive for ^pression of this gene. Eight 
melanoma cell lines were tested, of which seven showed varying levels of expression, and one 
showed no expr^on. Six breast cancer cell lines were tested, of which four showed various 
levels of expression, and two showed no e3q)ression. 

EXAMPLE 10 

An additional breast cancer cDNA library, referred to as **BR1 7-128", was screened, 
using autologous sera. A cDNA molecule was identified. 
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Analysis of the sequeace suggested that it was incomplete at the S ' end To extend the 
sequmce, atesticularcDNA library was screened with anucleotide probe based upon the partial 
sequence identified in the breast cancer library. An additional 1200 base pairs were identified 
following these screenings. The 201 1 base pairs of information are set forth in SEQ ID NO: 15. 

The longest open reading frame is 1539 base pairs, corresponding to a protein of about 
59.15 kilodaltons. The deduced sequence is set forth at S£Q ID NO: 16. 

RT-PCR was then carried out using the following primers: 

CACACAGGATCCATGCAGGCCCCGCACAAGGAG 
CACACAAAGCTTCTAGGATTTGGCACAGCCAGAG 
(SEQ ID NOS: 17 and 18) 

Strong signals were observed in normal testis and breast tissue, and weak expression was 
observed in placenta. 

No ejqnression was found in normal brain, kidney, liver, colon, adrenal, fetal brain, lung, 
pancreas, prostate, thymus, uterus, and ovary tissue of tumor cell lines tested, 2 of the breast 
cancer tines were strongly positive and two were weakly positive. Of melanoma two of 8 were 
strongly positive, and 3 were weakly positive. Of lung cancer cell lines, 4 of 15 were strongly 
positive, and 3 were weakly positive. 

When cancertissuespedmens were tested, 16 of25 breast cancer samples were strongly 
positive, and 3 additional samples were weakly positive. Two of 36 melanoma samples were 
positive (one strong, one weak). All other cancer tissue samples were negative. 

When Northern blotting was carried out, a high molecular weight smear was observed 
in testis, but in no other tissues tested. 
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EXAMPLE 11 

Further experimemts were carried out using the tumor sample referred to in exanqple 10, 
supra . This sample was derived fix>m a subcutaneous metastasis of a 60 year old female breast 
cancerpatient Total RNA was extracted, as described supra Following the extraction, a cDNA 
library was constructed in ArZAP expression vectors, also as described supra Screening was 
carried out, using the protocol set forth in example 1. A total of 7 x 10^ pfus were screened 
Fourteen reactive clones were identified, purified, and sequenced. The sequences were then 
compared to published sequences in GenBank and EST databases. These analyses indicated tiiat 
the clones were derived fixnn seven distinct genes, two of which were known, and five unknown. 
The two known genes were *TBK-1" (three clones), and TI-227 (one clone). These are 
universally e^qpressed genes, with the libraries ref^red to siqira showing BSTs for these genes 
fiom many dififerent tissues. 

Withrespect to the remaining 10 clones, six were derived fix)m the same gene, referred 
to hereafter as **NY-BR-1.'' Three cDNA sequences were found in the EST database ^ch 
shared identity with the gene. Two of these (AI 951118 and AW 373574) were identified as 
being derived fix>m a breast cancer library, while the third (AW 170035), was firon a pooled 
tissue source. 

EXAMPLE 12 

The distribution of the new geneNY-BR-1 refercedto was determined viaRT-PCR. 
In briet gene q)ecific oligonucleotide NY-BR-1 primers were designed to amplify cDNA 
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segments 300-600 base pairs in length, with primer melting teniperatures estimated at eS-TO^'C, 



The RT-PCR was then carried out over 30 amplification cycles, using a thermal cycler, and an 
annealing temperature of 60°C. Products were analyzed via 1.5% gel electrophoresis, and 
ethidium bromide visualization. Fifteen normal tissues (adrenal gland, fetal brain, lung, 
mammary gland, pancreas, placenta, prostate, thymus, uterus, ovary, brain, kidney, liver, colon 
and testis) were assayed. The NY-BR-1 clone gave a strong signal in mammary gland and testis 
tissue, and a very fidnt signal in placenta. All other tissues were negative. The other clones were 
expressed universally, based upon comparison to tnfisrmation in the EST database library, and 
WCTe not pursued further. 

The expression pattern of NY-BR-1 in cancer sanq)les was then tested, by carrying out 
RT-PCR, as described supra, on tumor samples. 

In order to determine the expression pattern, primers: 
caaagcagag cctcccgaga ag 

(SEQ ID NO: 20) and 

cctatgctgc tcttcgattc ttcc 

(SEQ ID NO: 21) were used 

Of twenty-five breast cancCT samples tested, twenty two werepositive for NY-BR-1. Ofthese, 
seventeen gave strong signals, and five gave weak to modest signals. 

An additional 82 non-mammary tumor samples were also analyzed, divided into 36 
melanoma, 26 non small cell lung cancer, 6 colon cancer, 6 squamous cell carcinoma, 6 
transitional cell carcinoma, and two leiyomyosarcomas. Only two melanoma sanq>les were 
positive for NY-BR-1 expression. 
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The study was then extended to expression of NY-BR-1 in tissue culture. Cell lines 
decived from breast tumor, melanoma, mi small cell lung cancer were studied. Four of six 
breast cancer cells were positive (two were very weak), four of eight melanoma (two very weak), 
and seven of fourteen small cell lung cancer lines (two very weak) were positive. 

EXAMPLE 13 

In order to determine the complete cDNA molecule for NY-BR-1 , the sequences of the 
six clones referred to siyra were conq)iled, to produce a nucleotide sequence 1464 base pairs 
long. Analysis of the open reading frame showed a contmuons OKF throughout, indicating that 
the compiled sequence is not complete. 

Comparison of the compiled sequence with the three EST library sequences referred to 
supra allowed for extension of the sequence. The EST entry AWl 70035 (446 base pairs long) 
overlapped the conopiled sequence by 89 base pairs at its S* end, pemodttmg extension of the 
sequence by another 357 base pairs. A translational temoinal codon was identified in this way, 
leading to a molecule with a 3'-untranslated region 333 base pairs long. The 5* end of the 
molecule was lacking, however, which led to the experiment described injfra. 

EXAMPLE 14 

In order to determine the missing, 5' end of the clone described supra, a commercially 
available testis cDNA expression library was screened, using a PGR expression product of the 
type described supra as a probe. In briefc 5x10* pfiis per 150 mm plate were transferred to 
nitrocellulose membranes, which were' then submerged in denaturation solution (1 .5MNaCl and 
0.5 M NaOH), transferred to neutralization solution (1 .5 M NaCl and 0.5M Tris-HCl), and then 
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rinsed with 0,2M Tris-HCl, and 2xSSC. Probes were labelled with and hybridization was 
carried out at high stringency conditions (i.e., 68®C, aqueous buffer). Any positive clones were 
subcloned, purified, and in vivo excised to plasniid PBK-CMV, as described supra . 

One of the clones identified in fins way included an additional 1346 base pairs at the S' 
end; however, it was not a full laigfli molecule. A 5*-RACE-PCR was carried out, using 
commercially available products. The PGR product was clonedinto plasmid vector pGEMT and 
sequenced. The results indicated that cDNA sequence was extended 1292 basepairs further, but 
no translation initiation site could be detennined, because no stop codons could be detected. It 
could be concluded, however, that &e cDNA of the NY-BR17 clone comprises at least 4026 
nucleotides, which are presented as SEQ ID NO: 22. The molecule, as dq)icted, encodes a 
protein at least about 152.8 kDA in molecular weight Structurally, there are 99 base pairs S' to 
the presumed translation initiation site, and an untranslated segment 333 base pairs long at the 
3' end. The predicted amino acid sequence of the coding region for SEQ ID NO: 22 is set out 
at SEQ. ID NO: 23. 

SEQ ID NO: 23 was analyzed for moti&, using the known search programs PROSITE 
and P&m. A bipartite nuclear localization signal motif was identified at amino acids 17--34, 
suggesting that Ihe protein is a nuclear pretein. Five tandem anlQrrin repeats were identified, at 
amino acids 49-81, 82.114, 115-147, 148-180 and 181-213. AbZIP site (i.e. a DNA binding 
site followed by aleucinezippermotif) was found at amino acid positions 1077-1 104, suggesting 
a transcription fector fimctioiL It was also observed tiiat three repetitive elements were identified 
m between tiie ankyrin rq>eats and the bZEP DNA binding site. To elaborate, a repetitive 
element 1 17 nucleotides long is trandecoly repeated 3 times, between amino acids 459-815. The 
second repetitive sequence, consisting of 1 1 amino acids, repeats 7 times between amino acids 
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224 and 300. The third repetitive element, 34 amino acids long, is repeated twice, between 
amino acids 301-368. 

EXAMPLE IS 

The six clones described supra were compared, and analysis revealed that they were 
derived £rom two different splice variants. Specifically, two clones, referred to as ''BRl 7-8" and 
**BR 17-44a", contain one more exon, of 1 1 1 base pairs (nucleotides 3015-3 125 of SEQ ID NO: 
22), which encodes amino acids 973-1009 of SEQ ID NO: 23, than do clones BR 17-la, BR17- 
35b and BR17-44b. The shortest of the six clones, BR17-128, starts 3' to fho additional exons. 
The key structural elements referred to supra were present in both splice variants, suggesting that 
there was no difference in biological function. 

The expression pattern of the two splice variants was assessed via PT-FCR, using 
primers which spaimed the 1 11 base pair exon referred to saprsu 

The primers used were: 

aatgggaaca agagctctgc ag 

(SEQ ID NO: 24) and 

gggtcatctg aagttcagca ttc 

(SEQ ID NO: 25) 

Both variants were expressed strongly in normal testis and breast The longer variant was 
dominant ia testis, and the shorter variant in breast cells. When breast cancer cells were tested, 
co-typing of the variant was observed, (7 strongly, 2 weakly positive, and 1 negative), with flie 
shorter variant being the predominant form consistently. 
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EXAMPLE 16 

The fi^quenc^ of antibody resfponse against NY-BR-1 in breast cancer patients was 
tested. To do this, a recombinant protein consisting of amino adds 993-1 1 88 of SEQ ID NO: 
23 was prepared. (This is the protein encoded by clone BR 17-128, referred to supmV A total 
of 140 serum samples were taken from breast cancer patients, as were 60 normal serum samples. 
These were analyzed via Western blotting, using standard methods. 

Four of the cancer sera saii:q)les were positive, including a sample from patient BR17. 
All normal sera were negative. 

An additional set of experiments was then carried out to determine if sera recognized the 
portion of NY-BR-1 protein with repetitive elements. To do this, a different recombinant 
protein, consisting of anodno acids 405-1000 was made, and tested in Western blot assays. None 
of the four antibody positive sera reacted with this protein indicatmg that an antibody epitope is 
located in the non-repetitive, carboxy terminal end of the molecule. 

EXAMPLE 17 

The screening of the testicular cDNA library referred to supra resulted, inter alia, in the 
identification of a cDNA molecule that was homologous to NY-BR-1. The molecule is 3673 
base pairs in length, excluding the poly A tail. This corresponded to nucleotides 1-3481 ofSEQ 
ID NO: 22, and showed 62% homology thereto. No sequence identity to sequeaces in libraries 
was noted. ORF analysis identified an ORF fiom nucleotide 641 through the end of the 
sequence, with 54% homology to the protein sequence of SEQ. ID NO: 23. The ATG initiation 
codon of this sequence is 292 base pairs fiirther 3' to the presumed initiation codon ofNY-BR-1, 
and is preceded by 640 untranslated base pairs at its 5* end. This 640 base pair sequence includes 
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scattered stop codons. The nucleotide sequence and deduced amino acid sequence are presented 
as SEQ ID NOS: 26 and 27, respectively. 

RT-PCR analysis was earned out in the same way as is described supra, using primers: 
tctcatagat gctggtgctg ate 

(SEQIDNO: 28) and 

cccagacatt gaattttggc agac 

(SEQIDNO: 29). 

Tissuerestricted mRNA e3q>ression was found. The expressionpattem differed fiom that of SEQ 
ID NO: 22. Ihbrie^ of six normal tissues examined, strong signals were found in brain and testis 
only. There was no or weak expression in normal breast tissues, and kidney, liver and colon 
tissues were negative. Eight of t^ 10 breast canc^ specimens tested supra were positive for 
SEQ. ID NO: 26, Six samples were positive for both SEQ. ID NO: 22 and 26, one for SEQ. ID 
NO: 22 only, two for the SEQ. ID NO: 26 only, and one was negative for botL 

EXAMPLE 18 

Recentiy, a working draft of the human genome sequence was released. This database 
was searched, using standard methods, and NY-BR-1 was found to have sequence identity with 
at least three chromosome 10 clones, identified by Genbank accession numbers AL1S7387, 
AL37148, and AC067744. These localize NY-BR-1 to chromosome 10 pll.21-12.L 

The comparison ofNY-BR-1 and the human genomic sequence led to definition ofNY - 
BR-1 exon-intron organization. In brie^ the coding region of the gene contains essentially 19 
structurally distinct exons with at least 2 exons encoding 3' untranslated regions. Detailed exon- 
intron junction information is described at Genbank AF 269081. 
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The six ankyrin repeats, referred to supra, are all found within exon 7. The 357 
nucleotide repeating unit is conq)osed of exons 10-1 S. The available genomic sequences are not 
complete, however, and* only one of the three copies was identified, suggesting that DNA 
sequences between exons S and 10 may be diq)licated and inserted in tandem, during genetic 
evohitioiL In brief^ when the isolated NY-BR-l cDNA clone was analyzed, three complete and 
one incomplete copy of the repeating units are present. The exon sequences can be expresses as 
exons l-2-3-4-5-6-7-8-9<10-l l-12-13-14-15H10A-nA-12A-13A-14A-15A^^^ 
13B-14B-15BH10C-llC-12C-13C-14C)-16-17-18-19-20-21, wherein A, B & C are inexact 
copies of exon 10-15 sequences. Cloned, NV-BR-l cDNA has 38 exons in toto . 

It was noted, supra, that the sequence of NY-BR- 1 cDNA was not complete at the 5' end. 
Genonic sequence (Gehbank AC067744), permitted extension of the 5' end. Translation of the 
5' genonic sequence led to the identification of a new translation initiation site, 168 base pairs 
iq)stream of thepreviously predicted ATG initiation codoa This led to anNY-BR-1 polypeptide 
including 1397 amino acid longer, 56 residue of which are added at the N-tenninus, compared 
to prior sequence information, Le.: 
MEEISAAAVK^A^PERPSPFSQLWrSND 
ID NO: 30). 

EXAMPLE 20 

Reference was made, supra, to flie two difference splice variants of NY-BR-1. 
Comparison of the splice variants with the genomic sequence confirmed that an alternate spUcing 
event, with the longer variant incorporating part of intron 33 into exon 34 (i.e., exon 17 of the 
basic exon/intron framework described supra) . 
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Key structural elements that were predicted in NY-BR-1, described supra, are present in 
both variants, suggesting that there is no difference in biological function, or subcellular location. 

EXAMPLE 21 

As with NY BR-1, the variant NY~BR-1.1, described siyra, was screened against the 
working draft of the human genome sequence. One clone was found with sequmce identity, i.e., 
GenBankAL359312, derive from chromosome 9. Thus,NY-BR-l andNY-BR-1.1 bothappear 
to be functioning genes, on two different chromosomes. The Genbank sequence referred to 
herein does not contain all of NY-BR-1.1, which precludes defining exon-intron structure. 
Nonetheless, at least 3 exons can be defined, which correspond to exons 16-18 of the NY-BR-1 
basic firamework. Exon-intronjunctions are conserved. 

EXAMPLE 22 

A series of peptides were synthesized, basediq)on the amino acid sequence ofNY -BR-1, 
as set forth in SEQ ID NO: 23. These were then tested for their ability to bmd to HLA-A2 
molecules and to stimulate CTL proliferation, using an ELISPOT assay. This assay involved 
coating 96-well, flat bottom nitrocellulose plates with Sug/ml of anti-intetfenm gamma 
antibodies in 100 ul of PBS per well, followed by overnight incubation. Purified CD8^ cells, 
which had been separated fiom PBL samples via magnetic beads coated with anti-CD8 
antibodies were then added, at 1x10^ cellsAvell, in RPMI 1640 medium, that had been 
supplemented with 10% human serum, L-asparagine (50 mgA), L^arginine (242 mg/1), L- 
glutamine (300 mg/1), together with IL-2 (2.5ng/ml), in a final volume of 100 ul. CD8^ effector 
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cells were prepared by presensitizing with peptide, and were then added at fiom 5x10^ to 2x1 0* 
cells/well. Peptides were pulsed onto iiradiatedT2 cells at a concentration of lOug/ml for 1 hour, 
washed and added to effector cells, at SxlO^ cells/well. The plates were incubated for 16 horns 
at STCy washed six times with 0.05% Twem 20/PBS, and were then supplemented with 
biotuiylated, anti-interferon gamma specific antibody at O.S ug/ml. After incubation for 2 hours 
at 37°C, plates were washed, and developed with commercially available reagents, for 1 hour, 
followed by 1 0 minutes of incubation with dye substrate. Plates were then prepped for coimting, 
positives being indicated by blue spots. The number of blue spotsAvell was determined as the 
fiequency of NY-ESO-l specific CTLs/well. 

Experiments were run, in tiipHcate, and total number of CTLs was calculated. As 
controls, one of reagents alone, effector cells alone, or antigen presenting cells alone were used. 
The difference between the number of positives in stimulated versus non-stimulated cells, was 
calculated as the ejffective number of peptide specific CTLs above background. Three peptides 
were found to be reactive, i.e.: 

LLSHGAVBEV (amino acids 102-111 of SEQIDNO: 23) 

SLSKUDTV (amino adds 904-912 of SEQ ID NO: 23 ) 

SLDQKLFQL (amino acids 1262-1270 of SEQ ID NO: 23). 



The complete list of peptides tested, with refereoce to their position in SEQ ID NO: 23, 
follows: 



Peptide 


Position 


FLVDRKVCQL 


35-43 


nJDSGADI 


68-76 
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AV X OXilLrOV 


90-98 


TT CA/\/AVT T 


95-103 




102-111 


VT T OTT/^ AX7T 


101-109 


T7T T rrr XT a "vt a 

FLLIKNANA 


134-142 


MLLQQNVDV 


167-175 


GMLLQQNVDV 


166-175 


LLQQNVDVFA 


168-177 


lAWEKKETPV 


361-370 


SLFBSSAKI 


430-438 


CIPENSIYQKV 


441-450 


KVMEINREV 


449-457 


ELMDMQTBKA 


687-696 


ELMDMQTFKA 


806-815 


SLSKILDTV 


904-912 


KILDTVHSC 


907-915 


DLNEKIREEL 


987-996 


RIQDIELKSV 


1018-1027 


YLLHENCML 


1043-1051 


asOJKKEIAML 


1049-1058 


AMLKLELATL 


1056-1065 


KBLKEKNAEL 


1081-1090 


VLiAnNlML. 




CXQRKMNVDV 


1174-1183 


KMNVDVSST 


1178-1186 


SLDQKLFQL 


1262-1270 


KLFQLQSKNM 


1266-1275 
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iZOo-lZ/ / 


QLQSKNMWL 


1269-1277 


NMWLQQQLV 


1274-1282 


WLQQQLVHA 


1276-1284 


KmDlHFL 


1293-1301 



The foregoing examples describe the isolation of a nucleic acid molecule which encodes 
a cancer associated antigen. "Associated" is used herdn because i^iile it is clear that the relevant 
molecule was expr^^sed by several types of cancer, oth^ cancers, not screened herein, may also 
e^)ress the antigen. 

The invoition relates to nucleic acid molecules which encode the antigens encoded by, 
e.g., SEQ ID NOS: 1, 3, 8, IS, 22 and 26 as well as fhe antigms encoded thereby, such as the 
proteins with the amino acid sequences of SEQ ID NOS: 5, 6, 7, 16, 23, 27, and 30. It is to be 
understood that all sequences which encode the recited antigen are a part of the invention. 

Also apart oftheiavention are proteins, polypeptides, andpeptides, which comprise, e.g., 
at least nine consecutive amino acids found in SEQ ID NO: 23, or at least nine consecutive 
amino adds of the amino acids of SEQ ID NO: 30. Proteins, polypeptides and peptides 
cono^rising nine or more amino acids of SEQ ID NO: S, 6, 7, 16 or 27 are also a part of the 
invention. Especially preferred are peptides comprising or consisting of amino adds 102-1 1 1, 
904-912, or 1262-1270 of SEQ ID NO: 23. Such peptides may, but do not necessarily provoke 
CTL responses when complexed witii an HLA molecule, such as an HLA-A2 molecule. They 
may also bind to different MHC or HLAmolecules, including, but not being hmited to, HLA-Al , 
A2, A3, B7, B8, Cw3, Cw6, or serve, e.g., as immunogens, as part of immunogenic cocktail 
compositions, where they are combined with other proteins or polypeptides, and so forth. Also 
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a part of the invention are the nucleic add molecules which encode these molecules, such as 
'^minigenes,*' expression vectors that include the coding regions, recombinant cells containing 
these, and so forth. All are a part of the invention. 

Also a part of the invention are expression vectors which incorporate the nucleic acid 
molecules of the invention, in operable liokage (i.e., "q)erably linked**) to a promoter. 
Construction of such vectors, such as viral (e.g., adenovirus or Vaccinia virus) or attenuated viral 
vectors is well within the skill of the art, as is the transformation or transfection of cells, to 
produce eukaryotic cell lines, or prokaryotic cell strains which encode the molecule of interest. 
Exemplary of the host cells which can be employed in this fiishion are COS cells, CHO cells, 
yeast cells, insect cells (e.g., Spodoptera frugipenial NIH 3T3 cells, and so forth. Prokaryotic 
ceUs, such as coU and other bact^a may also be used. Any of these cells can also be 
transformed or transfected with further nucleic acid molecules, such as those encoding cytokkies, 
e.g., interleukins such as IL-2, 4, 6, or 12 or HLA or MHC molecules. 

Also a part of the invention are the antigens described herein, both in original form and 
in any different post translational modified forms. The molecules are large enough to be 
antigeoic without any posttranslational modification, andheace areusefiil as immunogens, whea 
combined with an adjuvant (or without it), in both precursor and post-translationally modified 
forms. Antibodies produced using these antigens, both poly and monoclonal, are also a part of 
the invention as well as hybridomas which make monoclonal antibodies to the antigens. The 
whole protein can be used ther^eutically, or in poi;tions, as discussed itifra. Also a part of the 
invention are antibodies against this antigen, be these polyclonal, monoclonal, reactive 
fragments, such as Fab, (F(ab)2 ' and other fiagments, as well as chimeras, humanized antibodies, 
recombinantly produced antibodies, and so forth. 
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As is clear from the disclosure, one may use tiie proteins and nucleic acid molecules of 
the invention diagnostically. The SEREX meOiodology discussed herein is premised on an 
immune re^nse to a pathology associated antigen. Hence, one may assay for the relevant 
pathology via, e.g., testing a body fluid sample of a subject, such as serum, for reactivity with 
the antigen per se. Reactivity would be deemed indicative of possible presence of the pathology. 
So, too, could one assay for the expression of any of the antigens via any of the standard nucleic 
acid hybridization assays which are well known to the art, and need not be elaborated upon 
herein. One could assay for antibodies against the subject molecules, using standard 
immunoassays as welL 

Analysis of SEQ ID NO: 1, 3, 4, 8, 15, 22 and 26 will show that there are 5' and 3' non- 
coding regions presented therein. The invention relates to those isolated nucleic acid molecules 
which contain at least the coding segment, and which may contain any or all of the non-coding 
5' and 3' portions. 

Also a part of the invention are portions of the relevant nucleic acid molecules which can 
be used, for exaniple, as oligonucleotide primers and/or probes, such as one or more of SEQ ID 
NOS: 9, 10, 11, 12, 13, 14, 17, 18,20, 21, 24, 25, 28, and 29 as well as amplification products 
like nucleic acid molecules comprising at least nucleotides 305-748 of SEQ ID NO: 1, or 
anq)lification products described in the examples, including those in examples 12, 14, etc. 

As was discussed supra, study of other members of the "CT* femily reveals that these are 
also processed to peptides which provoke lysis by cytolytic T cells. There has been a great deal 
of work on motife for various MHC or HLA molecules, which is ^plicable here. Hence, a 
further aspect of the invention is a ther^eutic method, herein one or more pq>tides derived 
from the antigens of the invention which bind to an HLA molecule on the surface of a patients 
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tumor cells are administered to the patient, in an amount suJBBcient for the peptides to bind to the 
MHC/HLA molecules, and provoke lysis by T cells. Any combination of peptides maybe used. 
These p^tides, which may be used alone or in combination, as well as the entire protein or 
inouDiunoreactiye portions thereof may be administered to a subject in need thereoi^ using any of 
the standard types of administration, such as intravenous, intradermal, subcutaneous, oral, rectal, 
and transdermal administration. Standard pharmaceutical carriers, adjuvants, such as s^onins, 
GM-CSF, and int^leukins and so forth may also be used. Further, these peptides and proteins 
may be formulated into vaccines with the listed material, as may dendritic cells, or other cells 
wtudi presait relevant MHC/pq[>tide complexes. 

Similarly, the invention contemplates therapies wherein nucleic acid molecules which 
encode the proteins of the invention, one or more or peptides which are derived from these 
proteins are incorporated into a vector, such as a Vaccinia or adenovirus based vector, to render 
it transfectable into eukaryotic cells, such as human cells. Similarly, nucleic add molecules 
which encode one or more of the peptides may be incorporated into these vectors, which are then 
the major constituent of nucleic acid bases therapies. 

Any of these assajrs can also be used in progression/regression studies. One can monitor 
flie course of abnormality involving e3q)ression of these antigens simply by monitoring levels of 
the protein, its expression, antibodies against it and so forth using any or all of &e methods set 
forth supra. 

It should be clear that these methodologies may also be used to trade the efGcacy of a 
tha:apeutic regime. Essentially, one can take a baseline value for a protein of interest using any 
of the assays discussed supnu administer a given ther^eutic lagent, and then monitor levels of 
the protein thereafter, observing changes in antigen levels as indicia of the efficacy of the regime. 

29 




wo 01/47959 



PCT/USOO/42334 



As was indicated supra, the invention involves, inter alia, the recognition of an 
'integrated" immune response to the molecules of the inventioa One ramification of this is the 
ability to monitor the course of cancer ther^y. In this method, which is a part of the invention, 
a subject in need of the therapy receives a vaccination of a type described herein. Such a 
vaccination results, e.g. , ui a T cell response against cells {nresenting HLA/pepUdo complexes on 
their cells. The response also includes an antibody response, possibly a result of the release of 
antibody provoking proteins via the lysis of cells by the T cells. Hence, one can monitor the 
effect of a vaccine, by monitoring an antibody response. As is indicated, supra, an increase in 
antibody titer may be taken as an indicia of progress with a vaccine, and vice versa. Hence, a 
further aspect of the invention is a method for monitoring efficacy of a vaccine, following 
administration thereof by determining levels of antibodies in the subject which are specific for 
the vaccine itself, or a large molecule of which the vaccine is a part. 

The identification of the subject proteins as being inQ)licated in pathological conditions 
such as cancer also suggests a number of therapeutic approaches in addition to those discussed 
supra . The e)q>eriments set forth supra establish that antibodies are produced in response to 
expression of the proteiiL Hence, a further embodiment of the invention is the treatment of 
conditions which are characterized by aberrant or abnormal levels of one or more of the proteins, 
via administration of antibodies, such as humanized antibodies, antibody fiagments, and so forth. 
These may be tagged or labelled with appropriate cystostatic or cytotoxic reagents. 

T cells may also be administered. It is to be noted tibat the T cells may be elicited in vitro 
using immune responsive cells such as dendritic cells, lyniphocytes, or any other immune 
responsive cells, and th^ reperfused into the subject being treated. 
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Note that the generation of T cells and/or antibodies can also be accomplished by 
administering cells, preferably treated to be rendered non-proliferative, which present relevant 
T cell or B cell epitopes for response, such as the qpitopes discussed supra . 

The ther^>eutic {Q>proaches may also include antisense theriq)ies, wherein an antisense 
molecule, preferably from 10 to 100 nucleotides in length, is administered to the subject either 
^'neatf' or in a carrier, such as a hposome, to facilitate incorporation iiito a ceU, followed by 
inhibition of e3q)ression of the protein. Such antisense sequences may also be incorporated into 
^propriate vaccines, such as in viral vectors (e.g.. Vaccinia), bacterial constructs, such as 
variants of the known BCG vaccine, and so forth. 

Other features and apphcations of the invention will be clear to the skilled artisan, and 
need not be set forth herein. The terms and expression which have been employed are used as 
terms of description and not of limitation, and there is no intention in the use of such terms and 
expression of excluding any equivalents of the features shown and described or portions thereoi^ 
it being recognized that various modifications are possible within the scope of the invention. 
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We claim : 

1 . Isolated nucleic acid molecule which encodes a cancer associated antigen, whose 
amino acid sequence is identical to the amino acid sequence encoded by the nucleotide sequence 
of SEQ ID NO: 1, 3, 4, 8, 15, 19, 22, or 26. 

2. Theisolatednucleicacidmoleculeofclaiml,compiisingthenucleotideseque^ 
of SEQ ID NO: 1. 

3. Theisolatednucldcacidmoleculeofclaim l,comprish]gfhenucleotidesequence 
of SEQ ID NO: 3. 

4. Theisolatednucleicacidmoleculeof claim l,conq>iis]ng the nucleotide sequence 
of SEQ ID NO: 4. 

5. The isolatednucldc acidmolecule of claim 1, compiisingfhenucleotide sequence 
of SEQ ID NO: 8. 

6. Theisolatednucldcacidmoleculeofclaim l,conq>rismgthenucleotide sequence 
ofSEQBDNO: 15. 

7. Theisolatednucleicacidmoleculeofclaim l,conq)iising the nucleotide sequence 
ofSEQIDNO: 19. 

8. The isolatednucleicaddmoleculeof claim l,conqiiisingfhenucl6otide sequence 
OfSEQIDNO: 22. 

9. Theisolatednucldcacidmoleculeofclaim 1, compiisingfhenucleotide sequence 
OfSEQIDNO: 26. 

10. Expression vector comprising the isolated nucleic acid molecule of claim 1, 
operably linked to a promoter. 
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1 1 . Eukaiyotic cell line or prokaryotic cell strain, transfonned or transf ected with the 
expression vector of claim 10. 

12. Isolated cancer associated antigen comprising all or part of the amino add 
sequence eacoded by SEQ ID NO: 1, 3, 4, 8, 15, 19, 22 or 26. 

13. Eukaiyotic cell line or prokaryotic cell strain, transformedor transacted with the 
isolated nucleic acid molecule of claim 1 . 

14. The eukaryotic cell line or prokaryotic cell strain of claim 13, herein said cell 
line is also transfected with a nucleic acid molecule coding for a cytokine. 

15. The eukaryotic cell line or prokaryotic cell strain of claim 14, wherein said cell 
line is further transfected by a nucleic acid molecule coding for an MHC molecule. 

16. The eukaiyotic cell line or prokaryotic cell strain of claim 14, ^ereta said 
cytokrue is an interleukin. 

17. The eukaryotic cell line or prokaryotic cell strain of claim 16, wherein said 
interleukin is IL-2, IL-4 or IL-12. 

18. The eukaryotic cell line or prokaryotic cell strain of claim 13, wherein said cell 
line has been rendered non-proliferative. 

19. The eukaryotic cell line of claim 13, wherein said cell line is a fibroblast cell line. 

20. Expression vector comprising a mutated or attenuated virus and the isolated 
nucleic acid molecule of claim 1. 

21 . The expression vector of claim 20i, wherein said virus is adenovirus or vaccinia 

virus. 

22. The expression vector of claim 21, wherein said virus is vaccinia virus. 

23. The eiqpression vector of claim 21, wherem said virus is adenovirus. 
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24. Expression systrai useful in transfecting a cell, comprising (i) a first vector 
containing a nucleic acid molecule which codes for the isolated cancer associated antigen of 
claim 13 and (ii) a second vector selected j&om the group consisting of (a) a vector containing 
a nucleic acid molecule which codes for an MHC or HLA molecule which presents an antigen 
derived from said cancer associated antigen and (b) a vector containing a nucleic acid molecule 
which codes for an interleukin. 

25. Immunogenic composition comprising tiie isolated cancer antigen of claim 12, 
and a pharmaceutically acceptable adjuvant 

26. The immunogenic composition of claim 25, wherem said adjuvant is a cytokine, 
a saponin, or GM-CSF. 

27. Immunogenic composition comprising at least one peptide consisting of an amino 
acid sequence of from 8 to 12 amino acids concatenated to each other in the isolated cancer 
associated cancer antigen of claim 12, and a pharmaceutically accq)table adjuvant 

28. The immunogenic composition of claim 27, wherein said adjuvant is a saponin, 
a cytokme, or GM-CSF. 

29. The immunogenic conq)osition of claim 25, wherein said composition comprises 
a plurality of peptides which complex with a specific MHC molecule. 

30. Immunogenic composition which comprises at least one expression vectorwhich 
encodes a peptide derived fix>m the amino acid sequence encoded by SBQ ID NO: 1,3,4, 8, 15, 
19, 22 or 26. 

3 1 . The immunogenic composition of claim 30, wherein said at least one expression 
vector codes for a plurality of peptides. 
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32. Vaccine useful in treating a subject afiOicted with a cancerous condition 
comprising the isolated eukaryotic cell line of claim 13 and a phatmacologically acceptable 
adjuvant 

33 . The vaccine of claim 32, wherein said eukaryotic cell tine has been rendered non- 
proliferative. 

34. The vaccine of claim 33, wherein said eukaryotic cell line is a human cell line. 

35. A composition of matter useful in treating a cancerous condition comprising a 
non-proliferative cell line having expressed on its surface apeptide derived fiom the amino acid 
sequence encoded by SEQ ID NO: 1, 3, 4, 8, IS, 19, 22 or 26. 

36. The composition of matter of claim 35, wherein said cell line is a human ceU line. 

37. A composition of matter usefid in treating a cancerous condition, comprising (i) 
a peptide derived fix)m the amino acid sequence encoded by SEQ ID NO: 1, 3, 4, 8, 15, 19, 22 
or 26, (ii) an MHC or HLA molecule, and Qn) a pharmaceutically acceptable carrier. 

38. Isolated antibody which is specific for the cancer associated antigen of claim 12. 

39. The isolated antibody of claim 38, wherein said antibody is a monoclonal 
antibody. 

40. Method for screening for cmceac in a sanq>le, comprising contacting said sample 
with a nucleic acid molecule which hybridizes to all or part of the noiolecule encoded by SEQ ID 
NO: 1, 2, 3, 4, 8, 15, 19, 22 or 26 and determining hybridization as an indication of cancer cells 
in said sample. 

41. A method for screening for cancer in a sample, comprisiog contacting said sample 
with the isolated antibody of claim 38, and determining binding of said antibody to a target as 
an indicator of cancer. 
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42. Method for diagnosiiig a cancerous condition in a subject, comprising contacting 
an immune reactive cell containing sample of said subject to a cell line transfected with the 
isolated nucleic addmolecule of claim 1» and determining interaction of said transfected cell line 
with said immunoreactive cell, said interaction being indicative of said cancer condition. 

43. A method for determining regression, progression of onset of a cancerous 
condition comprising monitoring a sample fix>m a patient with said cancerous condition for a 
parameter selected fiom the group consisting of (i) aprotein encoded by SEQ ID NO: 1, 2, 3, 4, 
8, IS, 19, 22 or 26, (ii) apeptide dmved fiom said protein, (iii) cytolytic T cells specific for said 
peptide and an MHC molecule with which it non-covalentty comple^ies, and (iv) antibodies 
specific for said CT protein, wherein amount of said parameter is indicative of progression or 
regression or onset of said cancerous condition. 

44. The method of claim 43, wherein said sample is a body fluid or exudate. 

45. The method of claim 43, wherein said sanqple is a tissue. 

46. The method of claim 43, comprising contacting said sample with an antibody 
which specifically binds with said protein or peptide. 

47. Themethodof claim 46, whereinsaidantibody is labelled witharadioactive label 
or an enzyme. 

48. The method of claim 46, wherein said antibody is a monoclonal antibody. 

49. The method of claim 43, comprising amplifying KNA which codes for said 
protein. 

50. The method of claim 49, wherein said amplifying comprises carrying out 
polymerase chain reaction. 
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5 1 . The method of claim 42, comprising contacting said sample with a nucleic acid 
molecule which specifically hybridizes to a nucleic acid molecule which codes for or expresses 
said protein. 

52. The method of claim 49, wherein said nucleic acid molecule comprises SEQ ID 
NO: 9, 10, 11, 12, 13, 14, 17, 18, 20, 21, 24, 25, 28 or 29. 

53. The method of claim 43, comprising assaying said sample for shed protein. 

54. The melhod of claim 43, con^risiag assaying said sanq)le for antibodies specific 
for said protein, by contacting said sample with protein. 

55. Method for diagnosmg a cancerous condition comprising assaying a sample taken 
fiom a subject for an immimoreactive cell specific for a peptide derived from a protein encoded 
by SEQ ID NO: 1, 2, 3, 4, 8, 15, 19, 22 or 26, complexed to an MHC molecule, presence of said 
immunoreactive cell being indicative of said cancerous condition. 

56. Composition conqnising at least onepeptide consisting of an amino acid sequence 
of from 8 to 25 amino acids concatenated to each other in the isolated cancer associated antigen 
of claim 12, and aphaimaceutically acceptable adjuvant 

57. The composition of claim 56, wherein said adjuvant is a s^onin, a cytokine, or 
GM-CSF. 

58. The conq)osition of claim 56, conoprisiiig a plurality of MHC binding peptides. 

59. Q)mpositioncomprisinganexpressionvectorwhichencodesatleastonep^tide 
consisting of an amino acid sequence of from 8 to 25 amino acids concatenated to each other in 
the isolated cancer associated antigen of claim 12, and phannaceutically acceptable adjuvant. 

60. The composition of claim 59, wherein said expression vector encodes a plurality 
of peptides. 
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61. A method for screwing for possible presence of a pathological condition, 
comprising assaying a sanqple fix>m a patient believed to have a pathological condition for 
antibodies specific to at least one of the cancer associated antigens encoded by SEQ ID NOS: 1, 
2, 3, 4, 8, IS, 19, 22 or 26, presence of said antibodies being indicative of possible presence of 
said pathological condition. 

62. Themethodof claim 61, wherein said pathological condition is cancer. 

63. The mediod of claim 61, wherein said cancer is melanoma. 

64. The method of claim 61, further comprising contacting said sanq>le to pmified 
cancer associated antigm ^coded by SEQ ID NO: 1, 3, 4, 8, IS, 19, 22 or 26. 

65. A method for screening for possible presence of a pathological condition in a 
subject, comprising assaying a sample taken from said subject for expression of a nucleic acid 
molecule, the nucleotide sequence of which comprises SEQ ID NO: 1, 2, 3, 4, 8, 15, 19, 22 or 
26, expression of said nucleic acid molecule being indicative of possible presence of said 
pathological condition. 

66. The method of claim 65, wherein said pathological condition is canc^. 

67. Themethod of claim 65, comprising determining e7q)ression via polymerase chain 
reaction. 

68. The mebod of claim 65, comprising determining CTqpression by contacting said 
sample with at least one of SEQ ID NO: 9, 10, 11, 12, 13, 14, 17, 18, 20, 21, 24, 25, 28 or 29. 

69. A meOiod for determining regression, progression of onset of a cancerous 
condition comprising monitoring a sample fiom a patient with said cancerous condition for a 
parameter selected from the group consisting of (i) a cancer associated antigen mcoded by SEQ 
ID NO: 1, 2, 3, 4, 8, 15, 19, 22 or 25, (ii) a peptide derived from said cancer associated antigen, 
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(iii) cytolytic T cells specific for said peptide and an MHC molecule with which it non- 
covalently complexes, and (iv) antibodies specific for said cancer associated aiatigen, wherein 
amount of said parameter is indicative of progression or regression or onset of said cancerous 
condition. 

70. The method of claim 69, ^erem said swaple is a body fluid or exudate. 
. 71. The method of claim 69, wherein said sample is a tissue. 

72. The method of claim 69, comprising contacting said sample with an antibody 
which specifically binds with said protein or peptide. 

73. Themethod of claim 72, wherein said antibody is labelled with aradioactive label 
or an enzyme. 

74. The method of claim 72, wherein said antibody is a monoclonal antibody. 

75. The method of claim 69, comprising amplifying RNA which codes for said 
protein. 

76. The method of claim 75, wherein said amplifying comprises carrying out 
polymerase chain reaction. 

77. The method of claim 69, comprising contacting said sample with a nucleic add 
molecule which specifically hybridizes to a nucleic acid molecule which codes for or expresses 
said protein. 

78. The method of claim 69, comprising assaying said sample for shed cancer 
associated antigen. 

79. The method of claim 69, comprising assaying said sample for antibodies specific 
for said cancer associated antigen, by contacting said sample with said cancer associated antigen. 
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80. Method for screening for a cancerous condition comprising assaying a sample 
taken from a subject for an immunoreactive cell specific for a pq)tide derived from a cancer 
associated antigen encoded by SEQ ID NO: 1,2, 3,4,8, 15, 19,22or26,complexedtoanMHC 
molecule, presence of said immunoreactive cell being indicative of said cancerous conditioiL 

81. An isolated nucleic acidmolecule consisting ofanucleotide sequence defined by 
SEQ ID NO: 1, 2, 3, 8, 15, 19, 22 or 26. 

82. Isolated nucleic acid molecule the conplimentaiy sequence of which hydridizes, 
under stringent conditions, to the nucleotide sequence set forth in SEQ ID NO: 4, 5, 8, 15, 19, 
22 or 26. 

83. An isolated polypeptide comprising at least 9 consecutive amino acids set forth 
in SEQ ID NO: 5, 7, 16, 19, 23, 27, or 30. 

84. The isolated polypeptide of claim 83, comprising at least 9 consecutive amino 
acids set forth in SEQ ID NO: 23 or 30. 

85 . The isolated polypeptide of claim 84, comprising t least 9 consecutive amino acids 
of the amino add sequence set forth in SEQ ID NO: 23. 

86. The isolated polypeptide of claim 85, comprising anuno adds 102-1 1 1, 904-912 
or 1262-1270 of SEQ ID NO: 23. 

87. AnisolatednucldcaddmoleculewhichencodestheaniinoacidsequenceofSE^ 

ID NO: 30. 

88 . An isolated nucldc acid molecule y/bich encodes the isolated polypeptide of claim 

86. 

89. Expression vector comprising the isolated nucldc add molecule of claim 88, 
operably linked to a piomater. 
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<210> 1 
<211> 1533 
<212> DNA 
<213> Homo sapiens 
<220> 
<221> CDS 
<222> 235 
<400> 1 



ggttttccac gttggacaag tgcggctcgg 


cggccagcgg 


agcgcgcccc 


ttcccgctgc 


60 


ccgctccgct cctctcttct acccagccca 


gtgggcgagt 


gggcagcggc 


ggccgcggcg 


120 


ctgggccctc tcccgccggt gtgtgcgcgc 


tcgtacgcgc 


ggcccccggc 


gccagccccg 


180 


ccgcctgaga gggggcctgc gccgccggcc 


ggggcgtgcg 


cccgggagcc 


accgncaccg 


240 


cggcccgcgc cctcaggcgc tggggtcccc 


gcggacccgg 


aggcggcgga 


cgggctcggc 


300 


agatgtagcc gccgggccga agcaggagcc 


ggcggggggg 


cgccgggaga 


gcgagggctt 


360 


tgcattttgc agtgctattt tttgaggggg 


gcggagggtg 


gaggaagtcg 


gaaagccgcg 


420 


ccgagtcgcc ggggacctcc ggggtgaacc 


atgttgagtc 


ctgccaacgg 


ggagcagctc 


480 


cacctggtga actatgtgga ggactacctg 


gactccatcg 


agtccctgcc 


tttcgacttg 


540 


cagagaaatg tctcgctgat gcgggagatc 


gacgcgaaat 


accaagagat 


cctgaaggag 


600 


ctagacgagt gctacgagcg cttcagtcgc 


gagacagacg 


gggcgcagaa 


gcggcggatg 


660 


ctgcactgtg tgcagcgcgc gctgatccgc 


agccaggagc 


tgggcgacga 


gaagatccag 


720 


atcgtgagcc agatggtgga gctggtggag 


aaccgcacgc 


ggcaggtgga 


cagccacgtg 


780 


gagctgttcg aggcgcagca ggagctgggc 


gacacagcgg 


gcaacagcgg 


caaggctggc 


840 


gcggacaggc ccaaaggcga ggcggcagcg 


caggctgaca 


agcccaacag 


caagcgctca 


900 


cggcggcagc gcaacaacga gaaccgtgag 


aacgcgtcca 


gcaaccacga 


ccacgacgac 


960 


ggcgcctcgg gcacacccaa ggagaagaag 


gccaagacct 


ccaagaagaa 


gaagcgctcc 1020 


aaggccaagg cggagcgaga ggcgtcccct 


gccgacctcc 


ccatcgaccc 


caacgaaccc 


1080 
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acgtactgtc 


tgtgcaacca 


ggtctcctat 


tgccccatcg 


agtggttcca 


cttctcgtgc 


tggtactgtc 


ccaagtgccg 


9ggggagaac 


tccaaaaaag 


agagggctta 


caacaggtag 


caaaataaac 


cgtgtattta 


ttacattgct 


gtatattttt 


aaagaatgtt 


agaaaaggaa 


tgtttgcctt 


ttgttttcat 


tggtacacgt 


tttagaaact 


acaaatatag 


gtttgattca 


<210> 2 






<211> 1143 






<212> DNA 






<213> Homo 


sapiens 




<400> 2 






gagtaacccg 


ataatatgcc 


gttgtccggc 


agcagtgatc 


ccgggcctgt 


ggctcggggc 


cccgcggggg 


ctcggagaca 


gtttcaggcc 


gcgtggccgt 


ggaaacagat 


cctgaaggag 


gagacagacg 


gggcgcagaa 


gcggcggatg 


agccaggagc 


tgggcgacga 


gaagatccag 


aaccgcacgc 


gocagotgga 

9 3 33 33 


cagccacgtg 


gacacagtgg 


gcaacagcgg 


caaggttggc 


cagtctgaca 


agcccaacag 


caagcgctca 


aacgcgtcca 


gcaaccacga 


ccacgacgac 


gccaagacct 


ccaagaagaa 


gaagcgctcc 


gccgacctcc 


cca'bcgaccc 


caacgaaccc 


aaagaga'tga 


tcggctgcga 

**^3 3 3 3 


caacgacgag 


crtcrgggctca 


atcataaacc 


caagggcaag 

■ ■ ■ ^ 9 ^ — ^ 7 


gagaagacca 


tggacaaagc 


cctggagaaa 


tttgtggaca 


ggcgcctggt 


gtgaggagga 


gcctttgttg 


aggtgcaagg 


agtgtaaaat 


ccattccttt 


catagggatg 


gcagtgattc 


gtaacaagaa 


agtggtctgt 


ggatcagcat 


aca 












<211> 742 






<212> DNA 






<213> Homo 


sapiens 




<400> 3 






cgccgtccac 


accccagcgg 


ccctgacgct 


agtgacaggc 


aaggccacgc 


ccccgcgagg 



ggggagatga tcggctgcga caacgacgag 1140 
gtggggctca atcataaacc caagggcaag 1200 
gagaagacca tggacaaagc cctggagaaa 1260 
tttgtggaca ggcgcctggt gtgaggagga 1320 
gcctttgttg aggtgcaagg agtgtaaaat 1380 
ccattccttt catagggatg gcagtgattc 1440 
gtaacaagaa agtggtctgt ggatcagcat 1500 
aca 1533 



acggcgacga gaattcccag atatagcagt 60 

cggggctgca gttcggaccg cctcccgcga 120 

gcatctttgc tgacccgagg gtggggccgc 180 

ctagacgagt gctacgagcg cttcagtcgc 240 

ctgcactgtg tgcagcgcgc gctgatccgc 300 

atcgtgagcc agatggtgga gctggtggag 360 

gagctgttcg aggcgcagca ggagctgggc 420 

gcggacaggc ccaatggcga tgcggtagcg 480 

cggcggcagc gcaacaacga gaaccgtgag 540 

ggcgcctcgg gcacacccaa ggagaagaag 600 

aaggccaagg cggagcgaga ggcgtcccct 660 

acgtactgtc tgtgcaacca ggtctcctat 720 

tgccccatcg agtggttcca cttctcgtgc 780 

tggtactgtc ccaagtgccg gggggagaac 840 

tccaaaaaag agagggctta caacaggtag 900 

caaaataaac cgtgtattta ttacattgct 960 

gtatattttt aaagaatgtt agaaaaggaa 1020 

tgtttgcctt ttgttttcat tggtacacgt 1080 

tttagaaact acaaatatag gtttgattca 1140 

1143 



gtcccctccg cgaccctcgc ctctggaaaa 60 
gccggcctcg agcccgcagc ccccagggcc 120 
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tgggacgaga 


tcctgaagga 


gctagacgag tgctacgagc gcttcagtcg 


cgagacagac 


180 


ggggcgcaga 


agcggcggat 


gctgcactgt gtgcagcgcg cgctgatccg 


cagccaggag 


240 


ctgggcgacg 


agaagatcca 


gatcgtgagc cagatggtgg agctggtgga 


gaaccgcacg 


300 


cggcaggtgg 


acagccacgt 


ggagctgttc gaggcgcagc aggagctggg 


cgacacagcg 


360 


ggcaacagcg 


gcaaggctgg 


cgcggacagg cccaaaggcg aggcggcagc 


gcaggctgac 


420 


aagcccaaca 


gcaagcgctc 


acggcggcag cgcaacaacg agaaccgtga 


gaacgcgtcc 


480 


agcaaccacg 


accacgacga 


cggcgcctcg ggcacaccca aggagaagaa 


ggccaagacc 


540 


tccaagaaga 


agaagcgctc 


caaggccaag gcggagcgag aggcgtcccc 


tgccgacctc 


600 


cccatcgacc 


ccaacgaacc 


cacgtactgt ctgtgcaacc aggtctccta 


tggggagatg 


660 


atcggctgcg 


acaacgacga 


gtgccccatc gagtggttcc acttctcgtg 


cgtggggctc 


720 


aa'bca'baaac 


ccaagggcaa 


gt 




742 


<210> 4 










<211> 857 










<212> DNA 










<213> Homo 


sapiens 








<400> 4 










cct cccracraa 


cggtgtccat 


aacacagggc gggaagagat aaggcctagg 


gaaggcgccc 


60 


ctzcgggccrta 


tccacctctt: 


ctggggctcg gcactaggaa gcagcttccc 


tctcaggccc 


120 


ct^'b'tg'tcticc 


aagccgttcc 


aaactgagta ccgggagacg acacaaaggg 


agggcggtga 


180 


caaAiiaacoc 


aaacorcoaQa 


accacctagg ctgctgggag tggiiggtccg 


gccgcggaat 


240 


rrrrA fTfti"r'f*+*Cf 




acgagbgcta cgagcgcttc agtcgcgaga 


cagacggggc 


300 




cggatgctgc 


actgtgtgca gcgcgcgctg atccgcagcc 


aggagctggg 


360 


KXi^sj ay oay 


a'tccacra.iiccf 


taaaccaaat aa'tGraaac'tCT gtggagaacc 


gcacgcggca 


420 


y y ia^^tj at-*ay w 


caccrtacraoc 


tot^cgaoac ocagcaggag ctgggcgaca 


cagcgggcaa 


480 


(■vCiy ^y y ^aay 


cTctcrciccfcaa 

y ii»y y ^y **y tS 


acaaacccaa aggcgaggcg gcagcgcagg 


ctgacaagcc 


540 


caa cagcaag 


cgct cacggc 


ggcagcgcaa caacgagaac cgtgagaacg 


cgtccagcaa 


600 


ccacgaccac 


gacgacggcg 


cctcgggcac acccaaggag aagaaggcca 


agaCCuCCaa 




gaagaagaag 


cgctccaagg 


ccaaggcgga gcgagaggcg tcccctgccg 


acctccccat 


720 


cgaccccaac 


gaacccacgt 


actgtctgtg caaccaggtc tcctatgggg 


agatgatcgg 


780 


ctgcgacaac 


gacgagtgcc 


ccatcgagtg gttccacttc tcgtgcgtgg 


ggctcaatca 


840 


taaacccaag 


ggcaagt 






857 



<210> 5 
<211> 279 
<212> PRT 

<213> Hcjino sapiens 
<400> 5 

Met Leu Ser Pro Ala Asn Gly Glu Gin Leu His Leu Val Asn Tyr Val 
15 10 15 

Glu Asp Tyr Leu Asp Ser lie Glu Ser Leu Pro Phe Asp Leu Gin Arg 

20 25 30 

Asn Val Ser Leu Met Arg Glu lie Asp Ala Lys Tyr Gin Glu lie Leu 
35 40 45 
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Lys Glu Leu Asp Glu Cys Tyr Glu Arg Phe Ser Arg Glu Thr Asp Gly 

50 55 60 

Ala Gin Lys Arg Arg Met Leu His Cys Val Gin Arg Ala Leu He Arg 
65 70 75 80 

Ser Gin Glu Leu Gly Asp Glu Lys He Gin He Val Ser Gin Met Val 
85 90 95 

Glu Leu Val Glu Asn Arg Thr Arg Gin Val Asp Ser His Val Glu Leu 
100 105 110 

Phe Glu Ala Gin Gin Glu Leu Gly Asp Thr Val Gly Asn Ser Gly Lys 

115 120 125 

Val Gly Ala Asp Arg Pro Asn Gly Asp Ala Val Ala Gin Ser Asp Lys 
130 135 140 

Pro Asn Ser Lys Arg Ser Arg Arg Gin Arg Asn Asn Glu Asn Arg Glu 
145 150 155 160 

Asn Ala Ser Ser Asn His Asp His Asp Asp Gly Ala Ser Gly Thr Pro 
165 170 175 

Lys Glu Lys Lys Ala Lys Thr Ser Lys Lys Lys Lys Arg Ser Lys Ala 
180 185 190 

Lys Ala Glu Arg Glu Ala Ser Pro Ala Asp Leu Pro He Asp Pro Asn 
195 ) 200 205 

Glu Pro Thr Tyr Cys Leu Cys Asn Gin Val Ser Tyr Gly Glu Met He 

210 215 220 

Gly Cys Asp Asn Asp Glu Cys Pro He Glu Trp Phe His Phe Ser Cys 
225 230 235 240 

Val Gly Leu Asn His Lys Pro Lys Gly Lys Trp Tyr Cys Pro Lys Cys 
245 250 255 

Arg Gly Glu Asn Glu Lys Thr Met Asp Lys Ala Leu Glu Lys Ser Lys 
260 265 270 

Lys Glu Arg Ala Tyr Asn Arg 
275 

<210> 6 
<211> 210 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Met Leu His Cys Val Gin Arg Ala Leu He Arg Ser Gin Glu Leu Gly 
15 10 15 

Asp Glu Lys He Gin He Val Ser Gin Met Val Glu Leu Val Glu Asn 
20 25 30 

Arg Thr Arg Gin Val Asp Ser His Val Glu Leu Phe Glu Ala Gin Gin 
35 40 45 

Glu Leu Gly Asp Thr Val Gly Asn Ser Gly Lys Val Gly Ala Asp Arg 
50 . 55 60 

Pro Asn Gly Asp Ala Val Ala Gin Ser Asp Lys Pro Asn Ser Lys Arg 

65 70 75 80 

Ser Arg Arg Gin Arg Asn Asn Glu Asn Arg Glu Asn Ala Ser Ser Asn 
85 90 95 

His Asp His Asp Asp Gly Ala Ser Gly Thr Pro Lys Glu Lys Lys Ala 
100 105 110 
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Lys Thr Ser Lys Lys Lys Lys Arg Ser Lys Ala Lys Ala Glu Arg Glu 

115 120 125 

Ala Ser Pro Ala Asp Leu Pro lie Asp Pro Asn Glu Pro Thr Tyr Cys 
130 135 140 

Leu Cys Asn Gin Val Ser Tyr Gly Glu Met lie Gly Cys Asp Asn Asp 
145 150 155 160 

Glu Cys Pro lie Glu Trp Phe His Phe Ser Cys Val Gly Leu Asn His 
165 170 175 

Lys Pro Lys Gly Lys Trp Tyr Cys Pro Lys Cys Arg Gly Glu Asn Glu 

180 185 190 

Lys Thr Met Asp Lys Ala Leu Glu Lys Ser Lys Lys Glu Arg Ala Tyr 
195 200 205 

Asn Arg 
210 

<210> 7 
<211> 235 
<212> PRT 

<213> Homo sapiens 
<400> 7 

Met Glu lie Leu Lys Glu Leu Asp Glu Cys Tyr Glu Arg Phe Ser Arg 
15 10 15 • 

Glu Thr Asp Gly Ala Gin Lys Arg Arg Met Leu His Cys Val Gin Arg 
20 25 30 

Ala Leu lie Arg Ser Gin Glu Leu Gly Asp Glu Lys He Gin He Val 
35 40 45 

Ser Gin Met Val Glu Leu Val Glu Asn Arg Thr Arg Gin Val Asp Ser 
50 55 60 

His Val Glu Leu Phe Glu Ala Gin Gin Glu Leu Gly Asp Thr Val Gly 
65 70 75 80 

Asn Ser Gly Lys Val Gly Ala Asp Arg Pro Asn Gly Asp Ala Val Ala 
85 90 95 

Gin Ser Asp Lys Pro Asn Ser Lys Arg Ser Arg Arg Gin Arg Asn Asn 
100 105 110 

Glu Asn Arg Glu Asn Ala Ser Ser Asn His Asp His Asp Asp Gly Ala 
115 120 125 

Ser Gly Thr Pro Lys Glu Lys Lys Ala Lys Thr Ser Lys Lys Lys Lys 

130 135 140 

Arg Ser Lys Ala Lys Ala Glu Arg Glu Ala Ser Pro Ala Asp Leu Pro 
145 150 155 160 

He Asp Pro Asn Glu Pro Thr Tyr Cys Leu Cys Asn Gin Val Ser Tyr 

165 170 175 

Gly Glu Met He Gly Cys Asp Asn Asp Glu Cys Pro He Glu Trp Phe 
180 185 190 

His Phe Ser Cys Val Gly Leu Asn His Lys Pro Lys Gly Lys Trp Tyr 
195 200 205 

Cys Pro Lys Cys Arg Gly Glu Asn Glu Lys Thr Met Asp Lys Ala Leu 
210 215 220 

Glu Lys Ser Lys Lys Glu Arg Ala Tyr Asn Arg 
225 230 235 
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<210> 8 

<211> 772 

<212> DNA 

<213> Hcxmo sapiens 

<221> CDS 

<222> 695,714 

<400> 8 



aaagcgttct cggcggcagc gcaacaacta gaaccgtgag aacgcgtcca 


gcaaccgcga 


60 


cccacgacga cgtcacctcg ggcacgccca aggagaagaa agcccagacc 


tctaagaaga 


120 


agcagggctc catggccaag gcgtagcggc aggcgtcccc cgcagacctc 


cccatcgacc 


180 


ccagcgagcc ctcctactgg . gagatgatcc gctgcgacaa cgaatgcccc 


atcgagtggt 


240 


tccgcttctc gtgtgtgagt ctcaaccata aaccaaagcg caagtggtac 


tgttccagat 


300 


gccggggaaa gaacgatggg caaagccctt gagaagtcca gaaaaaaaac 


agggcttata 


360 


acaggtagtt tggggacatg cgtctaatag tgaggagaac aaaataagcc 


agtgtgttga 


420 


ttacattgcc acctttgctg aggtgcagga agtgtaaaat gtatattttt 


aaagaatgtt 


480 


gttagaggcc gggcgcggtg gctcacgcct gtaatcccag cactttggga 


ggccgaggcg 


540 


gtcggatcac gaggtcagga gatcgagacc atcctggcta acacggtgaa 


accccgtctc 


600 


tactaaaaat tcaaaaaaaa aattagctgg gcgtggtggc gggcgcctgt 


agtcccagct 


660 


attcgggagg ctgaggcagg agaatggcnt gaacctggga ggtggagctt 


gcantgagcc 


720 


aaggtcgcgc cactgcactc cagcctgggc gacagagcga gactccatct 


ta 


772 



<210> 9 

<211> 32 

<212> DNA 

<213> Homo sapiens 

<400> 9 

cacacaggat: ccatgttgag tcctgccaac gg 32 



<210> 10 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 10 

cgtggtcgtg gttgctggac gcg 23 



<210> 11 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 11 

cccagcggcc ctgacgctgt c 21 



<210> 12 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 12 

cgtggtcgtg gttgctggac gcg 23 



<210> 13 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 13 
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ggaagagata aggcctaggg aag 23 • 

<210> 14 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 14 

cgtggtcgtg gttgctggac gcg 23 

<210> 15 

<211> 2030 

<212> DNA 

<213> Homo sapiens 

<221> CDS 

<222> 1628, 1752, 1758, 1769, 1789, 1873, 1908, 1915, 1933, 1970, 1976, 2022 
<400> 15 

ctcgtgccgt taaagatggt cttctgaagg ctaactgcgg aatgaaagtt tctattccaa 60 

ctaaagcctt . agaattgatg gacatgcaaa ctttcaaagc agagcctccc gagaagccat 120 

ctgccttcga gcctgccatt gaaatgcaaa agtctgttcc aaataaagcc ttggaattga 180 

agaatgaaca aacattgaga gcagatgaga tactcccatc agaatccaaa caaaaggact 240 

atgaagaaag ttcttgggat tctgagagtc tctgtgagac tgtttcacag aaggatgtgt 300 

gtttacccaa ggctacacat caaaaagaaa tagataaaat aaatggaaaa ttagaagagt 360 

ctcctgataa tgatggtttt ctgaaggctc cctgcagaat gaaagtttct attccaacta 420 

aagccttaga attgatggac atgcaaactt tcaaagcaga gcctcccgag aagccatctg 480 

ccttcgagcc tgccattgaa atgcaaaagt ctgttccaaa taaagccttg gaattgaaga 540 

atgaacaaac attgagagca gatcagatgt tcccttcaga atcaaaacaa aagaaggttg 600 

aagaaaattc ttgggattct gagagtctcc gtgagactgt ttcacagaag gatgtgtgtg 660 

tacccaaggc tacacatcaa aaagaaatgg ataaaataag tggaaaatta gaagattcaa 720 

ctagcctatc aaaaatcttg gatacagttc attcttgtga aagagcaagg gaacttcaaa 780 

aagatcactg tgaacaacgt acaggaaaaa tggaacaaat gaaaaagaag ttttgtgtac 840 

tgaaaaagaa actgtcagaa gcaaaagaaa taaaatcaca gttagagaac caaaaagtta 900 

aatgggaaca agagctctgc agtgtgagat tgactttaaa ccaagaagaa gagaagagaa 960 

gaaatgccga tatattaaat gaaaaaatta gggaagaatt aggaagaatc gaagagcagc 1020 

ataggaaaga gttagaagtg aaacaacraac ttgaacaggc tctcagaata caagatatag 1080 

aattgaagag tgtagaaagt aatttgaatc aggtttctca cactcatgaa aatgaaaatt 1140 

atctcttaca tgaaaattgc atgttgaaaa aggaaattgc catgctaaaa ctggaaatag 1200 

ccacactgaa acaccaatac caggaaaagg aaaataaata ctttgaggac attaagattt 1260 

taaaagaaaa gaatgctgaa cttcagatga ccctaaaact gaaagaggaa tcattaacta 1320 

aaagggcatc tcaatatagt gggcagctta aagttctgat agctgagaac acaatgctca 1380 

cttctaaatt gaaggaaaaa caagacaaag aaatactaga ggcagaaatt gaatcacacc 1440 

atcctagact ggcttctgct gtacaagacc atgatcaaat tgtgacatca agaaaaagtc 1500 

aagaacctgc tttccacatt gcaggagatg cttgtttgca aagaaaaatg aatgttgatg 1560 

tgagtagtac cgatatataa caatgaggtg ctccatcaac cactttctga agctcaaagg 1620 
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aaatccanaa gcctaaaaat taatctcaat tatgcaggag atgctctaag agaaaataca 1680 
ttggtttcag gaacatgcac aaagagacca acgtgaaaca cagtgtcaaa tgaaggaagc 1740 
tgaacacatg tntcaaancg aacaagatna tgtgaacaaa cacactganc agcaggagtc 1800 . 
tctagatcag aaattatttc aactacaaag caaaaatatg tggcttcaac agcaattagt 1860 
tcatgcacat aangaaagct gacaacaaaa gcaagataac aattgatntt cattntcttg 1920 
agaggaaaat gcncatcatc ttctaaaaga gaaaaatgag gagatatttn attacnataa 1980 
ccatttaaaa aacccgtata tttcaatatg gaaaaaaaaa anaaaaaaaa 2030 

<210> 16 
<211> 513 
<212> PRT 

<213> Homo sapiens 
<400> 16 

Met Lys Val Ser lie Pro Thr Lys Ala Leu Glu Leu Met Asp Met Gin 
1.5 10 15 

Thr Phe Lys Ala Glu Pro Pro Glu Lys Pro Ser Ala Phe Glu Pro Ala 
20 25 30 

lie Glu Met Gin Lys Ser Val Pro Asn Lys Ala Leu Glu Leu Lys Asn 
35 40 45 

Glu Gin Thr Leu Arg Ala Asp Glu lie Leu Pro Ser Glu Ser Lys Gin 
50 55 60 

Lys Asp Tyr Glu Glu Ser Ser Trp Asp Ser Glu Ser Leu Cys Glu Thr 
65 70 75 80 

Val Ser Gin Lys Asp Val Cys Leu Pro Lys Ala Thr His Gin Lys Glu 
85 90 95 

Xle Asp Lys lie Asn Gly Lys Leu Glu Glu Ser Pro Asp Asn Asp Gly 
100 105 110 

Phe Leu Lys Ala Pro Cys Arg Met Lys Val Ser Xle Pro Thr Lys Ala 
115 120 125 

Leu Glu Leu Met Asp Met Gin Thr Phe Lys Ala Glu Pro Pro Glu Lys 
130 135 140 

Pro Ser Ala Phe Glu Pro Ala He Glu Met Gin Lys Ser Val Pro Asn 
145 150 155 160 

Lys Ala Leu Glu Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Gin Met 
165 170 175 

Phe Pro Ser Glu Ser Lys Gin Lys Lys Val Glu Glu Asn Ser Trp Asp 
180 185 190 

Ser Glu Ser Leu Arg Glu Thr Val Ser Gin Lys Asp Val Cys Val Pro 
195 200 205 

Lys Ala Thr His Gin Lys Glu Met Asp Lys lie Ser Gly Lys Leu Glu 

210 215 220 

Asp Ser Thr Ser Leu Ser Lys He Leu Asp Thr Val His Ser Cys Glu 
225 230 235 240 

Arg Ala Arg Glu Leu Gin Lys Asp His Cys Glu Gin Arg Thr Gly Lys 

245 250 255 

Met Glu Gin Met Lys Lys Lys Phe Cys Val Leu Lys Lys Lys Leu Ser 
260 265 270 

Glu Ala Lys Glu He Lys Ser Gin Leu Glu Asn Gin Lys Val Lys Trp 
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275 280 285 

Glu Gin Glu Leu Cys Ser Val Arg Leu Thr Leu Asn Gin Glu 61u. Glu 
290 295 300 

Lys Arg Arg Asn Ala Asp He Leu Asn Glu Lys He Arg Glu Glu Leu 
305 310 315 320 

Gly Arg He Glu Glu Gin His Arg Lys Glu Leu Glu Val Lys Gin Gin 

325 330 335 

Leu Glu Gin Ala Leu Arg He Gin Asp He Glu Leu Lys Ser Val Glu 
340 345 350 

Ser Asn Leu Asn Gin Val Ser His Thr His Glu Asn Glu Asn Tyr Leu 
325 360 365 

Leu His Glu Asn Cys Met Leu Lys Lys Glu He Ala Met Leu Lys Leu 
370 375 380 

Glu He Ala Thr Leu Lys His Gin Tyr Gin Glu Lys Glu Asn Lys Tyr 
385 390 395 400 

Phe Glu Asp He Lys He Leu Lys Glu Lys Asn Ala Glu Leu Gin Met 
405 410 415 

Thr Leu Lys Leu Lys Glu Glu Ser Leu Thr Lys Arg Ala Ser Gin Tyr 

420 425 430 

Ser Gly Gin Leu Lys Val Leu He Ala Glu Asn Thr Met Leu Thr Ser 
435 440 445 



Lys Leu Lys Glu Lys Gin Asp Lys Glu He Leu Glu Ala Glu He Glu 
450 455 460 

Ser His His Pro Arg Leu Ala Ser Ala Val Gin Asp His Asp Gin He 
465 470 475 480 

Val Thr Ser Arg Lys Ser Gin Glu Pro Ala Phe His He Ala Gly Asp 
485 490 495 

Ala Cys Leu Gin Arg Lys Met Asn Val Asp Val Ser Ser Thr Asp He 
500 505 510 



<210> 17 

<211> 33 

<212> DNA 

<213> Homo sapiens 

<400> 17 

cacacaggat ccatgcaggc cccgcacaag gag 33 



<210> 18 

<211> 34 

<212> DNA 

<213> Homo sapiens 

<400> 18 

cacacaaagc ttctaggatt tggcacagcc agag 34 



<210> 19 
<211> 294 
<212> PRT 

<213> Homo sapiens 
<400> 19 

Met Pro Leu Cys Thr Ala Thr Arg He Pro Arg Tyr Ser Ser Ser Ser 

15 10 15 

Asp Pro Gly Pro Val Ala Arg Gly Arg Gly Cys Ser Ser Asp Arg Leu 
20 25 30 
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Pro Arg Pro Ala Gly Pro Ala Arg Arg Gin Phe Gin Ala Ala Ser Leu 
35 40 45 

Leu Thr Arg Gly Trp Gly Arg Ala Trp Pro Trp Lys Gin Xle Leu Lys 
50 55 60 

Glu Leu Asp Glu Cys Tyr Glu Arg Phe Ser Arg Glu Thr Asp Gly Ala 
65 70 75 80 

Gin Lys Arg Arg Met Leu His Cys Val Gin Arg Ala Leu lie Arg Ser 
85 * 90 95 

Gin Glu Leu Gly Asp Glu Lys lie Gin He Val Ser Gin Met Val Glu 
100 105 110 

Leu Val Glu Asn Arg Thr Arg Gin Val Asp Ser His Val Glu Leu Phe 
115 120 125 

Glu Ala Gin Gin Glu Leu Gly Asp Thr Val Gly Asn Ser Gly Lys Val 

130 135 140 

Gly Ala Asp Arg Pro Asn Gly Asp Ala Val Ala Gin Ser Asp Lys Pro 
145 150 155 160 

Asn Ser Lys Arg Ser Arg Arg Gin Arg Asn Asn Glu Asn Arg Glu Asn 

165 170 175 

Ala Ser Ser Asn His Asp His Asp Asp Gly Ala Ser Gly Thr Pro Lys 
180 185 190 

Glu Lys Lys Ala Lys Thr Ser Lys Lys Lys Lys Arg Ser Lys Ala Lys 

195 200 205 

Ala Glu Arg Glu Ala Ser Pro Ala Asp Leu Pro He Asp Pro Asn Glu 
210 215 220 

Pro Thr Tyr Cys Leu Cys Asn Gin Val Ser Tyr Gly Glu Met lie Gly 
225 230 235 240 

Cys Asp Asn Asp Glu Cys Pro He Glu Trp Phe His Phe Ser Cys Val 
245 250 255 

Gly Leu Asn His Lys Pro Lys Gly Lys Trp Tyr Cys Pro Lys Cys Arg 
260 265 270 

Gly Glu Asn Glu Lys Thr Met Asp Lys Ala Leu Glu Lys Ser Lys Lys 
275 280 285 

Glu Arg Ala Tyr Asn Arg 
290 294 

<210> 20 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 20 

caaagcagag cctcccgaga ag 22 

<210> 21 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 21 

cctatgctgc tcttcgattc ttcc 24 

<210> 22 

<211> 4115 

<212> DNA 

<213> Homo sapiens 

<400> 22 
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ctagtctata 


cagcaacgac cctacatcgt 


cactctgggg tcttagaaag 


tccataaagc 


60 


tgcctcccgg 


gacaagtccg aagctggaga- 


gatg^caaa^ ggaagaagac 


atcaacctta 


120 


atatacaaga 


gcccagaaga gactgctcta 


actgggcctg 


gtcaatggcc 


tgaggaagta 


180 


gtaacalilzlic 


ggtagacaga agbgccagct 


gacgtccttg 


tggcgaacac 


ggacacctct 


240 


aatiaaaaacb 


tacaaligcca caggaggctt 


tgcaaatatt 


tgatagattc 


ggtgccgata 


300 


^aaatctcg^ 


ga'bgtg^a'tg caacaliggct: 


tccattatgc 


gtttatagtg 


gattttgtca 


360 


gtggtggcaa 


ac'tgc'fcg'bcc a'tggtgcag't 


atcgaagtgc 


caacaaggct 


gcctcacacc 


420 


acblititaclia 


cca^aacgaa agaagligagc 


aattgtggaa 


ttttgctgat 


aaaaatgcaa 


480 


Ci ^3 ^ 


gtiiaa^aagli taaatgcaca 


ccctcatgct 


gctgtatgtc 


tggatcatca 


540 


a <Tra t* a rrt* ^ CT 


catzgc^tctti agcaaaa^g^ 


gacgtctttg 


tgcagatata 


atcTQaataac 


600 






tcacattcat 


aacaaattat 


gaatatatac 


660 


rr a a a a i"t' a'tc 


aaaaatcatc aaataccaa^ 


cagaaggaac 


tctgcaggaa 


acctgatgag 


720 


rr f ^ n a f f t 


ggcggaaaga cacctgacac 


gctgaaagct 


ggtggaaaaa 


cacctgatga 


780 


ggctgcaccc 


"tggliggaaag acacctgaca 


ggctgaaagc 


tggtggaaaa 


acacctgatg 


840 




't'tcfCT'tcTa'a.cia aaca.'tc'toac 


aaattcaatg 


ttggagaaag 


gacatctgga 


900 




0t*r*Acrcaa'aa aadcacct&Q 


gaaattacga . 


tcctgcaaaa 


aaacatctga 


960 




rrrrr>r*afTr*aaa crfTaacra^cct& 


gaagatcgca gggagaaaaa 


gaagacacac 


1020 


C u y y dad u 


al'trarf^ocffr' aaaacia.3.a.ca. 


ctgagaaatt acgtgggcag 


aaaaggaaga 


1080 


rv/*^ an/va a 
CCtay gaayd 


r^rrr'A't'fTfmarr aaaAacrd3d.c 


% . ■ ■ t ; i i 1 
cctg^aaaga 


tggatgcgtg 


caagagtaac 


1140 


atctaataaa 


'f^^aaarrf"^^^ rraaaaafrrraa 
Cuaady i>L> yaaaaa^ijaa 


atctaagatg 


ttgcatgtcc 


acaaaagaat 


1200 


catctacaaa 


#Tr«aa rr^rTr*f*<a ^naf'oanacTCr 
y^Cda^uy^Cd i.(|d L>(^avja^y 


tcccatcaga 


tccaaacaag 


aaaagatgaa 


1260 


gaauau^Cbu 


ugduuwu^3(| y uv« wi^L-i>u«ja 


agttctgcaa 


gattcaagtg 


gtatacctga 


1320 


j«4* ^1- a 1- a 4- a ^ 

gUC^aCauai;. 


aaaaan^aaf* rfana^aaats. 
aadaciu kaa ua^ja^aQmwM 


agaagtagaa 


agcctcctaa 


aagccatctg 


1380 


cctli caagcc 


nr>r*a 1" Irca a a rrr*aaaaf^tct 


ttccaaataa 


gcctttgaat 


gaagaatgaa 


1440 


caaacattga 


arTr»aiTa^or*fT ^ fTt"t* Pf*caCC 


gaatccaaac 


aaaggactat 


aagaaaattc 


1500 


4« 4* tfv^Fv s 4* 4* 

u^gggai»uw^ 


afrarrt* f ^f^^fT rrafiao't'crttt 
dyd^^^w^lv^ ^|dy ai« w i> w 


acagaaggat 


tgtgtttacc 


aaggctacac 


1560 


o^^aa aa snsi 
aXtCciaciaay a 


a'han'a'taaaa aa.&'tcfcra.aaa 


tagaagagtc 


cctaataaag 


tggtcttctg 


1620 




r'rTrtaai'craaa tttctattcc 


actaaagcct/ 


agaattgaag 


acatgcaaac 


1680 


4'^4T*aaa fxt^n 


arrf*f*^nccrfTfi aa.occ&'tC'tQ 


cttcgagcct 


ccactgaaat 


caaaagtctg 


1740 


i* r« r^r*a a a i* a a 


gcct'tggaa't gaaaaaligaa 


aaacatggag 


gcagatgaga 


actcccatca 


1800 


rtaa^f f aaaf* 


aaaggac'ta't aagaaaattc 


tgggatactg 


gagtctctgt 


agactgtttc 


1860 


acagaaggat 


tgtgtttacc aaggctgcgc 


tcaaaaagaa 


tagataaaat 


aatggaaaat 


1920 


tagaagggtc 


cctgttaaag tggtcttctg 


aggctaactg 


ggaatgaaag 


ttctattcca 


1980 


actaaagcct 


agaattgatg acatgcaaac 


ttcaaagcag 


gcctcccgag 


agccatctgc 


2040 


cttcgagcct 


ccattgaaat caaaagtctg 


tccaaataaa 


ccttggaatt 


aagaatgaac 


2100 


aaacattgag 


gcagatgaga actcccatca 


aatccaaaca 


aaggactatg 


agaaagttct 


2160 
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tgggattctg gagtctctgt agactgtttc 

tcaaaaagaa tagataaaat aatggaaaat 

aggctccctg agaatgaaag ttctattcca 

ttcaaagcag gcctcccgag agccatctgc 

tccaaataaa ccttggaatt aagaatgaac 

aatcaaaaca aagaaggttg agaaaattct 

cagaaggatg gtgtgtaccc aggctacaca 

agaagattca ctagcctatc aaaatcttgg 

ttcaaaaaga cactgtgaac acgtacagga 

ctgaaaaaga actgtcagaa caaaagaaat 

ggaacaagag tctgcagtgt agattgactt 

atatattaaa gaaaaaatta ggaagaatta 

gaagtgaaac acaacttgaa aggctctcag 

taatttgaat aggtttctca actcatgaaa 

tgaaaaagga attgccatgc aaaactggaa 

gaaaataaat ctttgaggac ttaagatttt 

aaaactgaaa aggaatcatt actaaaaggg 

tagctgagaa acaatgctca ttctaaattg 

gaaattgaat acaccatcct gactggcttc 

aagaaaaagt aagaacctgc ttccacattg 

ttgatgtgag agtacgatat taacaatgag 

aaatccaaaa cctaaaaatt atctcaatta 

ttcagaacat cacaaagaga caacgtgaaa 

atcaaaacga caagataatg gaacaaacac 

tttcaactac aagcaaaaat tgtggcttca 

caacaaaagc agataacaat gatattcatt 

aagagaaaaa gaggagatat taattacaat 

aaagagaaag agaaacagaa actcatgaga 

acagaccaga ctttactcac actcatgcta 

atcttaccaa agtctgtgtc acagaatact 

aagcctacag cataaaataa agtgtfgaaga 

ggattcccat taccctgatg tgcagcagac 

tccagcctag tgacagagtg gactccacct 

<210> 23 
<211> 1341 
<212> PRT 

<213> Homo sapiens 
<400> 23 

Met Thr Lys Arg Lys Lys Thr lie \ 



cagaaggatg gtgtttaccc aggctacaca 2220 
agaagagtct ctgataatga ggttttctga 2280 
ctaaagcctt gaattgatgg catgcaaact 2340 
ttcgagcctg cattgaaatg aaaagtctgt 2400 
aacattgaga cagatcagat ttcccttcag 2460 
gggattctga agtctccgtg gactgtttca 2520 
caaaaagaaa ggataaaata gtggaaaatt 2580 
tacagttcat cttgtgaaag gcaagggaac 2640 
aaatggaaca atgaaaaaga. gttttgtgta 2700 
aaatcacagt agagaaccaa aagttaaatg 2760 
aaaccaagaa aagagaagag agaaatgccg 2820 
gaagaatcga gagcagcata gaaagagtta 2880 
atacaagata agaattgaag gtgtagaaag 2940 
tgaaaattat tcttacatga aattgcatgt 3000 
tagccacact aaacaccaat ccaggaaaag 3060 
aaagaaaaga tgctgaactt agatgaccct 3120 
atctcaatat gtgggcagct aaagttctga 3180 
aggaaaaaca gacaaagaaa actagaggca 3240 
gctgtacaag ccatgatcaa ttgtgacatc 3300 
aggagatgct gtttgcaaag aaaatgaatg 3360 
tgctccatca ccactttctg agctcaaagg 3420 
gcaggagatg tctaagagaa atacattggt 3480 
acagtgtcaa tgaaggaagc gaacacatgt 3540 
ctgaacagca gagtctctag tcagaaatta 3600 
cagcaattag tcatgcacat agaaagctga 3660 
tcttgagagg aaatgcaaca catctcctaa 3720 
accatttaaa aaccgtatat tcaatatgaa 3780 
acaagcagta gaaacttctt tggagaaaca 3840 
gaggccagtc tagcatcacc tatgttgaaa 3900 
attttagaag aaaattcatg tttcttcctg 3960 
ttacttgttc cgaattgcat aagctgcaca 4020 
tcattcaatc aaccagaatc cgctctgcac 4080 
ggaaa 4115 



Leu Asn lie Gin Asp Ala Gin 
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1 5 10 15 

Lys Acq Thr Ala Leu His Trp Ala Cys Val Asn Gly His Glu Glu Val 
20 25 30 

Val Thr Phe Leu Val Asp Arg Lys Cys Gin Leu Asp Val Leu Asp Gly 
35 40 45 

Glu His Arg Thr Pro Leu Net Lys Ala Leu Gin Cys His Gin Glu Ala 

50 55 60 

Cys Ala Asn lie Leu lie Asp Ser Gly Ala Asp He Asn Leu Val Asp 
65 70 75 80 

Val Tyr Gly Asn Met Ala Leu His Tyr Ala Val Tyr Ser Glu He Leu 
85 90 95 

Ser Val Val Ala Lys Leu Leu Ser His Gly Ala Val He Glu Val His.. 
100 105 110 

Asn Lys Ala Ser Leu Thr Pro Leu Leu Leu Ser He Thr Lys Arg Ser 

115 120 125 

Glu Gin He Val Glu Phe Leu Leu He Lys Asn Ala Asn Ala Asn Ala 
130 135 140 

Val Asn Lys Tyr Lys Cys Thr Ala Leu Met Leu Ala Val Cys His Gly 
145 150 155 160 

Ser Ser Glu He Val Gly Met Leu Leu Gin Gin Asn Val Asp Val Phe 
165 170 175 

Ala Ala Asp He Cys Gly Val Thr Ala Glu His Tyr Ala Val Thr Cys 
180 185 190 

Gly Phe His His He His Glu Gin He Met Glu Tyr He Arg Lys Leu 
195 200 205 

Ser Lys Asn His Gin Asn Thr Asn Pro Glu Gly Thr Ser Ala Gly Thr 
210 215 220 

Pro Asp Glu Ala Ala Pro Leu Ala Glu Arg Thr Pro Asp Thr Ala Glu 
225 230 235 240 

Ser Leu Val Glu Lys Thr Pro Asp Glu Ala Ala Pro Leu Val Glu Arg 
245 250 255 

Thr Pro Asp Thr Ala Glu Ser Leu Val Glu Lys Thr Pro Asp Glu Ala 
260 265 270 

Ala Ser Leu Val Glu Gly Thr Ser Asp Lys He Gin Cys Leu Glu Lys 
275 280 285 

Ala Thr Ser Gly Lys Phe Glu Gin Ser Ala Glu Glu Thr Pro Arg Glu 

290 295 300 

He Thr Ser Pro Ala Lys Glu Thr Ser Glu Lys Phe Thr Trp Pro Ala 
305 310 315 320 

Lys Gly Arg Pro Arg Lys He Ala Trp Glu Lys Lys Glu Asp Thr Pro 

325 330 335 

Arg Glu He Met Ser Pro Ala Lys Glu Thr Ser Glu Lys Phe Thr Trp 
340 345 350 

Ala Ala Lys Gly Arg Pro Arg Lys He Ala Trp Glu Lys Lys Glu Thr 
355 360 365 

Pro Val Lys Thr Gly Cys Val Ala Arg Val Thr Ser Asn Lys Thr Lys 
370 375 380 

Val Leu Glu Lys Gly Arg Ser Lys Met He Ala Cys Pro Thr Lys Glu 
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385 



390 



395 



400 



Ser Ser Thr Lys Ala Ser Ala Asn Asp Gin Arg Phe Pro Ser Glu Ser 
405 410 415 



Lys Gin Glu Glu Asp Glu Glu Tyr Ser Cys Asp Ser Arg Ser Leu Phe 
420 425 430 

Glu Ser Ser Ala Lys lie Gin Val Cys lie Pro Glu Ser He Tyr Gin 
435 440 445 

Lys Val Met Glu He Asn Arg Glu Val Glu Glu Pro Pro Lys Lys Pro 
450 455 460 

Ser Ala Phe Lys Pro Ala He Glu Met Gin Asn Ser Val Pro Asn Lys 
465 470 475 480 

Ala Phe Glu Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Pro Met Phe 
485 490 495 



Pro Pro Glu Ser Lys Gin Lys Asp Tyr Glu Glu Asn Ser Trp Asp Ser 
500 505 510 

Glu Ser Leu Cys Glu Thr Val Ser Gin Lys Asp Val Cys Leu Pro Lys 
515 520 525 

Ala Thr His Gin Lys Glu He Asp Lys He Asn Gly Lys Leu Glu Glu 
530 535 540 

Ser Pro Asn Lys Asp Gly Leu Leu Lys Ala Thr Cys Gly Met Lys Val 
545 550 555 560 

Ser He Pro Thr Lys Ala Leu Glu Leu Lys Asp Met Gin Thr Phe Lys 
565 570 575 

Ala Glu Pro Pro Gly Lys Pro Ser Ala Phe Glu Pro Ala Thr Glu Met 
580 585 ' 590 

Gin Lys Ser Val Pro Asn Lys Ala Leu Glu Leu Lys Asn Glu Gin Thr 
595 600 605 

Trp Arg Ala Asp Glu He Leu Pro Ser Glu Ser Lys Gin Lys 2^p Tyr 
610 615 620 

Glu Glu Asn Ser Trp Asp Thr Glu Ser Leu Cys Glu Thr Val Ser Gin 
625 630 635 640 

Lys Asp Val Cys Leu Pro Lys Ala Ala His Gin Lys Glu He Asp Lys 
645 650 655 

He Asn Gly Lys Leu Glu Gly Ser Pro Val Lys Asp Gly Leu Leu Lys 
660 665 670 

Ala Asn Cys Gly Met Lys Val Ser He Pro Thr Lys Ala Leu Glu Leu 
675 680 685 

Met Asp Met Gin Thr Phe Lys Ala Glu Pro Pro Glu Lys Pro Ser Ala 
690 695 700 

Phe Glu Pro Ala He Glu Met Gin Lys Ser Val Pro Asn Lys Ala Leu 

705 710 715 720 

Glu Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Glu He Leu Pro Ser 
725 730 735 

Glu Ser Lys Gin Lys Asp Tyr Glu Glu Ser Ser Trp Asp Ser Glu Ser 

740 745 750 

Leu Cys Glu Thr Val Ser Gin Lys Asp Val Cys Leu Pro Lys Ala Thr 
755 760 765 

His Gin Lys Glu He Asp Lys He Asn Gly Lys Leu Glu Glu Ser Pro 
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770 775 780 

. Asp Asn Asp Gly Phe Leu Lys Ala Pro Cys Arg Met Lys Val Ser He 
785 790 795 800 

Pro Thr Lys Ala Leu Glu Leu Met Asp Met Gin Thr Phe Lys Ala Glu 
805 810 815 

Pro Pro Glu Lys Pro Ser Ala Phe Glu Pro Ala He Glu Met Gin Lys 

820 825 830 

Ser Val Pro Asn Lys Ala Leu Glu Leu Lys Asn Glu Gin Thr Leu Arg 
835 840 845 

Ala Asp Gin Met Phe Pro Ser Glu Ser Lys Gin Lys Lys Val Glu Glu 
850 855 860 

Asn Ser Trp Asp Ser Glu Ser Leu Arg Glu Thr Val Ser Gin Lys Asp 
865 870 875 880 

Val Cys Val Pro Lys Ala Thr His Gin Lys Glu Met Asp Lys lie Ser 
885 890 895 

Gly Lys Leu Glu Asp Ser Thr Ser Leu Ser Lys He Leu Asp Thr Val 
900 905 910 

His Ser Cys Glu Arg Ala Arg Glu Leu Gin Lys Asp His Cys Glu Gin 
915 920 925 

Arg Thr Gly Lys Met Glu Gin Met Lys Lys Lys Phe Cys Val Leu Lys 
930 935 940 

Lys Lys Leu Ser Glu Ala Lys Glu He Lys Ser Gin Leu Glu Asn Gin 
945 950 955 960 

Lys Val Lys Trp Glu Gin Glu Leu Cys Ser Val Arg Leu Thr Leu Asn 
965 970 975 

Gin Glu Glu Glu Lys Arg Arg Asn Ala Asp He Leu Asn Glu Lys He 
980 985 990 

Arg Glu Glu Leu Gly Arg He Glu Glu Gin His Arg Lys Glu Leu Glu 
995 1000 1005 

Val Lys Gin Gin Leu Glu Gin Ala Leu Arg He Gin Asp He Glu Leu 
1010 1015 1020 

Lys Ser Val Glu Ser Asn Leu Asn Gin Val Ser His Thr His Glu Asn 
1025 1030 1035 1040 

Glu Asn Tyr Leu Leu His Glu Asn Cys Met Leu Lys Lys Glu He Ala 
1045 1050 1055 

Met Leu Lys Leu Glu He Ala Thr Leu Lys His Gin Tyr Gin Glu Lys 
1060 1065 1070 

Glu Asn Lys Tyr Phe Glu Asp He Lys He Leu Lys Glu Lys Asn Ala 
1075 1080 1085 

Glu Leu Gin Met Thr Leu Lys Leu Lys Glu Glu Ser Leu Thr Lys Arg 
1090 1095 1100 

Ala Ser Gin Tyr Ser Gly Gin Leu Lys Val Leu He Ala Glu Asn Thr 
1105 1110 1115 1120 

Met Leu Thr Ser Lys Leu Lys Glu Lys Gin Asp Lys Glu He Leu Glu 
H25 1130 1135 

Ala Glu He Glu Ser His His Pro Arg Leu Ala Ser Ala Val Gin Asp 
H40 1145 1150 

His Asp Gin He Val Thr Ser Arg Lys Ser Gin Glu Pro Ala Phe His 
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1155 1160 1165 1 



He Ala Gly Asp Ala Cys Leu Gin Arg Lys Met Asn Val Asp Val Ser 
1170 1175 1180 

Ser Thr He Tyr Asn Asn Glu Val Leu His Gin Pro Leu Ser Glu Ala 
1185 1190 1195 1200 

Gin Arg Lys Ser Lys Ser Leu Lys He Asn Leu Asn .Tyr Ala Gly Asp 
1205 1210 1215 

Ala Leu Arg Glu Asn Thr Leu Val Ser .Glu His Ala Gin Arg Asp Gin 
1220 1225 1230 

Arg Glu Thr Gin Cys Gin Met Lys Glu Ala Glu His Met Tyr Gin Asn 
1235 1240 1245 

Glu Gin Asp Asn Val Asn Lys His Thr Glu Gin Gin Glu Ser Leu Asp 
1250 1255 1260 

Gin Lys Leu Phe Gin Leu Gin Ser Lys Asn Met Trp Leu Gin Gin Gin 
1265 1270 1275 1280 

Leu Val His Ala His Lys Lys Ala Asp Asn Lys Ser Lys He Thr He 
1285 1290 1295 

Asp He His Phe Leu Glu Arg Lys Met Gin His His Leu Leu Lys Glu 
1300 1305 1310 

Lys Asn Glu Glu He Phe Asn Tyr Asn Asn His Leu Lys Asn Arg He 
1315 1320 1325 

Tyr Gin Tyr Glu Lys Glu Lys Ala Glu Thr Glu Asn Ser 
1330 1335 1340 



<210> 24 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 24 

aatgggaaca agagctctgc ag 22 

<210> 25 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 25 

gggtcatctg aagttcagca ttc 23 

<210> 26 

<211> 3673 

<212> DNA 

<213> Homo sapiens 

<221> CDS 

<222> 439, 473, 1789 
<400> 26 

caagagcttg gcgatacaga aatttctgct ggtgttgggg cgggtgcggg aactgaagac 60 
gggcgagtgc gagccggggg cgggtgctgg ggaagggtaa gcgggaagcg agggcgaggg 120 
gtaggggctg gggaagggcg agcgggaggc gcgggctctc tctagcaggg ggctgcagcc 180 
atgaagaggc tcttagctgc cgctggcaag ggcgtgcggg gcccggagcc cccgaacccc 240 
ttcagcgaac gggtctacac tgagaaggac tacgggacca tctacttcgg ggatctaggg 300 
aagatccata cagctgcctc ccggggccaa gtccagaagc tggagaagat gacagtaggg 360 
aagaagcccg tcaacctgaa caaaagagat atgaagaaga ggactgctct acactgggcc 420 
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tgtgtcaatg gccatgcana agtagtaaca 
gtccttgatg gcgaagggag gacacctctg 
ttgtgcaaat attctcatag atgctggtgc 
cacggctctc cattatgccg tttatagtga 
ctatggtgca gtcatcgagg tgcaaaacaa 
acagaaaaga agcaagcaaa ctgtggaatt 
atttaatgag tctaaatgca cagccctcat 
agtcggcatg cttcttcagc aaaatgttga 
tgcagaacgt tatgctgctg ctcgtggagt 
tatacgaaaa ttacctaaaa atcctcaaaa 
acctgatgag gctgcaccct tggcggaaag 
aaaaacacct gacgaggctg cacgcttggt 
ggggaaagca acatctggaa agtttgaaca 
gaggcctaca aaagaaacat ctgagaaatt 
gatcacatgg gaggaaaaag aaacatctgt 
taataaaact gaagttttgg aaaaaggaac 
aacatctaca aaagcaagta caaatgtgga 
ttttggcaca cggactattg aaaattcaca 
tgctaccaag attatctcta agagtgctgc 
atatcaaaaa - gatatcaaaa caataaatca 
atccaaacga gaggaagatg aagaatattc 
tgcaaagact caagtgtgta tacctgagtc 
agaagtagaa gagcttcctg agaagccatc 
gactgttcca aataaagcct ttgaattgaa 
gttcccatca gaatccaaac aaaaggacga 
ctgtgagacg gtttcacaga aggatgtgta 
cgatacctta agtggaaaat tagaagagtc 
ctgtggaagg aaagtttctc ttccaaataa 
caaagcagag tctcctgata aagatggtct 
tcttccaaat aaagccttag aattaaagga 
taatgatggt cttctgaagc ctacctgtgg 
agaattgaag gacagagaaa cattcaaagc 
aaaggatgat gaagaaaatt cttgggattt 
tgatgtgtgt ttacccaagg ctacacatca 
agaagagtct cctgataaag atggtcttct 
tccaaataaa gccttagaat tgaaggacag 



tttctggtag acagaaagtg ccngcttaat 480 
atgaaggctc tacaatgcga gagggaagct 540 
tgatctaaat tatgtagatg tgtatggcaa 600 
gaatttatta atggtggcaa cactgctgtc 660 
ggctagcctc acaccccttt tactggccat 720 
tttactaaca aaaaatgcaa atgcaaacgc 780 
gcttgccata tgtgaaggct catcagagat 840 
cgtctttgct gaagacatac atggaataac 900 
taattacatt catcaaccuic ttttggaaca 960 
taccaatcca gaaggaacat ctacaggaac 1020 
aacacctgac acggctgaaa gcttgctgga 1080 
ggagggaacg tctgccaaaa ttcaatgtct 1140 
gtcaacagaa gaaacaccta ggaaaatttt 1200 
ttcatggcca gcaaaagaaa gatctaggaa 1260 
aaagactgaa tgcgtggcag gagtaacacc 1320 
atctaatatg attgcatgtc ctacaaaaga 1380 
tgtgagttct gtagagccta tattcagtct 1440 
gtgtacaaaa gttgaggaag actttaatct 1500 
acagaattat acgtgtttac ctgatgctac 1560 
caaaatagaa gatcagatgt tcccatcaga 1620 
ttgggattct gggagtctct ttgagagttc 1680 
tatgtatcag aaagtaatgg agataaatag 1740 
tgccttcaag cctgccgtng aaatgcaaaa 1800 
gaatgaacaa acattgagag cagctcagat 1860 
tgaagaaaat tcttgggatt ctgagagtcc 1920 
tttacccaaa gctacacatc aaaaagaatt 1980 
tcctgttaaa gatggtcttc tgaagcctac 2040 
agccttagaa ttaaaggaca gagaaacatt 2100 
tctgaagcct acctgtggaa ggaaagtttc 2160 
cagagaaaca ctcaaagcag agtctcctga 2220 
aaggaaagtt tctcttccaa ataaagcttt 2280 
agctcagatg ttcccatcag aatccaaaca 2340 
tgagagtttc cttgagactc tcttacagaa 2400 
aaaagaattc gataccttaa gtggaaaatt 2460 
gaagcctacc tgtggaatga aaatttctct 2520 
agaaacattc aaagcagagg atgtgagttc 2580 
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tgtagagtcc 


acattcagrtc 


tttttggcaa 


accgac1:act: gaaaabtcac 


agtctacaaa''2640 


agttgaggaa 


gactttaatc 


ttactaccaa 


ggagggagca acaaagacag 


taactggaca 


2700 


acaggaacgt 


gatattggca 


ttattgaacg 


agctccacaa gatcaaacaa 


ataagatgcc 


2760 


cacatcagaa 


ttaggaagaa 


aagaagatac 


aaaatcaact tcagattctg 


agattatctc 


2820 


tcrtQacrtaat 


acacagaatt 


atgagtgttt 


acctgaggct acatatcaaa 


aagaaataaa 


2880 


gacaacaaat 


ggcaaaatag 


aagag^ctcc 


tgaaaagcct "tctcactttg 


agcctgccac 


2940 


tgaaatgcaa 


aacbctg^tc 


caaataaagg 


cttagaatgg aagaateaac 


aaacattgag 


3000 


agcaga t't ca 


actaccctali 


caaaaa'tc'b'b 


gga'bgcac't't cc't'tc't'tgtg 


aaagaggaag 


3060 


ggaactliaaa 


aaaga'kaac'b 


gtgaacaaa'b 


liacagcaaaa atiggaacaaa 


tgaaaaataa 


3120 


ottttatata 


ciiacaaaaaci 


aactgtcaga 


agcgaaagaa a'taaaatzcac 


agttagagaa 


3180 




aaaiiCTdoaac 


a.a.cTa crct ciicr 


caortaticTacTa. tlicTCC^tfcaa 


atcaagaaga 


3240 




aciaaa.'kcThca' 

ouuaw wu w\^«j 


a'ta'ba'fcbaaa 


agaaaaaatli agacccgaag 


agcaacttag 


3300 




cTaa.crtcTaa.a.c 


accaact'tga 


acagackcbc agaa'tacaag 


atatagaatt 


3360 




acaag^aa^t 


'tgaatcagg't 


t'bctcacact: catgaaagtg 


aaaatga'tc't 


3420 


ctttcatgaa 


aattgcatgt 


tgaaaaagga 


aattgccatg ctaaaactgg 


aagtagccac 


3480 


actgaaacat 


caacaccagg 


tgaaggaaaa 


taaatacttt gaggacatta 


agattttaca 


3540 


agaaaagaat 


gctgaacttc 


aaatgaccct 


aaaactgaaa cagaaaacag 


taacaaaaag 


3600 


ggcatctcag 


tatagagagc 


agcttaaagt 


tctgacggca gagaacacga 


tgctgacttc 


3660 


taaattgaag 


gaa 








3673 



<210> 27 

<211> 1011 

<212> PRT 

<213> Homo sapiens 

<400> 27 

Met Val Ala Thr Leu Leu Ser Tyr Gly Ala Val He Glu Val Gin Asn 
15 10 15 

Lys Ala Ser Leu Thr Pro Leu Leu Leu Ala He Gin Lys Arg Ser Lys 
20 25 30 

Gin Thr Val Glu Phe Leu Leu Thr Lys Asn Ala Asn Ala Asn Ala Phe 
35 40 45 

Asn Glu Ser Lys Cys Thr Ala Leu Met Leu Ala He Cys Glu Gly Ser 
50 55 60 

Ser Glu He Val Gly Met Leu Leu Gin Gin Asn Val Asp Val Phe Ala 
65 70 75 80 

Glu Asp He His Gly He Thr Ala Glu Arg Tyr Ala Ala Ala Arg Gly 

85 90 95 

Val Asn Tyr He His Gin Gin Leu Leu Glu His He Arg Lys Leu Pro 
100 105 110 

Lys Asn Pro Gin Asn Thr Asn Pro Glu Gly Thr Ser Thr Gly Thr Pro 
115 120 125 

Asp Glu Ala Ala Pro Leu Ala Glu Arg Thr Pro Asp Thr Ala Glu Ser 
130 135 140 

Leu Leu Glu Lys Thr Pro Asp Glu Ala Ala Arg Leu Val Glu Gly Thr 
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145 



150 



155 



160 



Ser Ala Lys lie Gin Cys Leu Gly Lys Ala Thr Ser Gly Lys Phe Glu 
165 170 175 

Gin Ser Thr Glu Glu Thr Pro Arg Lys lie Leu Arg Pro Thr Lys Glu 
180 185 190 

Thr Ser Glu Lys Phe Ser Trp Pro Ala Lys Glu Arg Ser Arg Lys lie 
195 200 205 

Thr Trp Glu Glu Lys Glu Thr Ser Val Lys Thr Glu Cys Val Ala Gly 
210 215 220 

Val Thr Pro Asn Lys Thr Glu Val Leu Glu Lys Gly Thr Ser Asn Met 
225 230 235 240 

He Ala Cys Pro Thr Lys Glu Thr Ser Thr Lys Ala Ser Thr Asn Val 
245 250 255 

Asp Val Ser Ser Val Glu Pro He Phe Ser Leu Phe Gly Thr Arg Thr 

260 265 270 

He Glu Asn Ser Gin Cys Thr Lys Val Glu Glu Asp Phe Asn Leu Ala 
275 280 285 

Thr Lys lie He Ser Lys Ser Ala Ala Gin Asn Tyr Thr Cys Leu Pro 
290 295 300 

Asp Ala Thr Tyr Gin Lys Asp He Lys Thr He Asn His Lys He Glu 
305 310 315 320 

Asp Gin Met Phe Pro Ser Glu Ser Lys Arg Glu Glu Asp Glu Glu Tyr 

325 330 335 



Ser Trp Asp Ser Gly Ser Leu Phe Glu Ser Ser Ala Lys Thr Gin Val 
340 345 350 

Cys He Pro Glu Ser Met Tyr Gin Lys Val Met Glu He Asn Arg Glu 
355 360 365 

Val Glu Glu Leu Pro Glu Lys Pro Ser Ala Phe Lys Pro Ala Val Glu 
370 375 380 

Met Gin Lys Thr Val Pro Asn Lys Ala Phe Glu Leu Lys Asn Glu Gin 
385 390 395 400 

Thr Leu Arg Ala Ala Gin Met Phe Pro Ser Glu Ser Lys Gin Lys T^p 
405 410 415 

Asp Glu Glu Asn Ser Trp Asp Ser Glu Ser Pro Cys Glu Thr Val Ser 
420 425 430 

Gin Lys Asp Val Tyr Leu Pro Lys Ala Thr His Gin Lys Glu Phe Asp 
435 440 445 

Thr Leu Ser Gly Lys Leu Glu Glu Ser Pro Val Lys Asp Gly Leu Leu 
450 455 460 

Lys Pro Thr Cys Gly Arg Lys Val Ser Leu Pro Asn Lys Ala Leu Glu 
465 470 475 480 

Leu Lys Asp Arg Glu Thr Phe Lys Ala Glu Ser Pro Asp Lys Asp Gly 
485 490 495 

Leu Leu Lys Pro Thr Cys Gly Arg Lys Val Ser Leu Pro Asn Lys Ala 
500 505 510 

Leu Glu Leu Lys Asp Arg Glu Thr Leu Lys Ala Glu Ser Pro Asp Asn 
515 520 525 

Asp Gly Leu Leu Lys Pro Thr Cys Gly Arg Lys Val Ser Leu Pro Asn 
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530 535 540 



Lys Ala Leu Glu Leu Lys Asp Arg Glu Thr Phe Lys Ala Ala Gin Met ' - 

545 550 555 560.- 

Fhe Pro Ser Glu Ser Lys Gin Lys Asp Asp Glu Glu Asn Ser Trp Asp 
565 570 . 575 

Phe Glu Ser Phe Leu Glu Thr Leu Leu Gin Asn Asp Val Cys Leu Pro 
580 585 590 

Lys Ala Thr His Gin Lys Glu Phe Asp Thr Leu Ser Gly Lys Leu Glu . 
595 600 605 

Glu Ser Pro Asp Lys Asp Gly Leu Leu Lys Pro Thr Cys Gly Met Lys 
610 6i5 620 

lie Ser Leu Pro Asn Lys Ala Leu Glu Leu Lys Asp Arg Glu Thr Phe 
625 630 635 640 

Lys Ala Glu Asp Val Ser Ser Val Glu Ser Thr Phe Ser Leu Phe Gly 

645 650 655 

Lys Pro Thr Thr Glu Asn Ser Gin Ser Thr Lys Val Glu Glu Asp Phe 
660 665 670 

Asn Leu Thr Thr Lys Glu Gly Ala Thr Lys Thr Val Thr Gly Gin Gin: 
675 680 685 

Glu Arg Asp lie Gly He He Glu Arg Ala Pro Gin Asp Gin Thr Asn 
690 695 700 

Lys Met Pro Thr Ser Glu Leu Gly Arg Lys Glu Asp Thr Lys Ser Thr 
705 710 715 720 

Ser Asp Ser Glu He He Ser Val Ser Asp Thr Gin Asn Tyr Glu Cys 
725 730 735 

Leu Pro Glu Ala Thr Tyr Gin Lys Glu He Lys Thr Thr Asn Gly Lys 
740 745 750 

He Glu Glu Ser Pro Glu Lys Pro Ser His Phe Glu Pro Ala Thr Glu 
755 760 765 

Met Gin Asn Ser Val Pro Asn Lys Gly Leu Glu Trp Lys Asn Lys Gin 
770 775 780 

Thr Leu Arg Ala Asp Ser Thr Thr Leu Ser Lys He Leu Asp Ala Leu 
785 790 795 800 

Pro Ser Cys Glu Arg Gly Arg Glu Leu Lys Lys Asp Asn Cys Glu Gin 
805 810 815 

He Thr Ala Lys Met Glu Gin Met Lys Asn Lys Phe Cys Val Leu Gin 
820 825 830 

Lys Glu Leu Ser Glu Ala Lys Glu He Lys Ser Gin Leu Glu Asn Gin 
835 840 845 

Lys Ala Lys Trp Glu Gin Glu Leu Cys Ser Val Arg Leu Pro Leu Asn 
850 855 860 

Gin Glu Glu Glu Lys Arg Arg Asn Val Asp He Leu Lys Glu Lys He 
865 870 875 880 

Arg Pro Glu Glu Gin Leu Arg Lys Lys Leu Glu Val Lys His Gin Leu 
885 890 895 

Glu Gin Thr Leu Arg He Gin Asp He Glu Leu Lys Ser Val Thr Ser 
900 905 - 910 

Asn Leu Asn Gin Val Ser His Thr His Glu Ser Glu Asn Asp Leu Phe 
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915 920 925 

His Glu Asn Cys Met Leu Lys Lys 61u lie Ala Met Leu Lys Leu Glu 
930 935 940 

Val Ala Thr Leu Lys His Gin His Gin Val Lys Glu Asn Lys Tyr Phe 
945 950 955 960 

Glu Asp He Lys He Leu Gin Glu Lys Asn Ala Glu Leu Gin Met Thr 

965 970 975 

Leu Lys Leu Lys Gin Lys Thr Val Thr Lys Arg Ala Ser Gin Tyr Arg 
980 985 990 

Glu Gin Leu Lys Val Leu Thr Ala Glu Asn Thr Met Leu Thr Ser Lys 
995 1000 1005 

Leu Lys Glu 
1010 

<210> 28 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 28 

tctcatagat gctggtgctg ate 23 

<210> 29 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 29 

cccagacatt gaattttggc agac 24 

<210> 30 

<211> 56 

<212> PRT 

<213> Hozoo sapiens 

<400> 30 

Met Glu Glu He Ser Ala Ala Ala Val Lys Val Val Pro Gly Pro Glu 
15 10 15 

Arg Pro Ser Pro Phe Ser Gin Leu Val Tyr Thr Ser Asn Asp Ser Tyr 
20 25 30 

He Val His Ser Gly Asp Leu Arg Lys He His Lys Ala Ala Ser Arg 
35 40 45 

Gly Gin Val Arg Lys Leu Glu Lys 
50 55 
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